Devices and methods for determining the length of biopolymers and distances between probes bound thereto

ABSTRACT

Devices and methods for detecting the length of analytes and/or sequencing analytes are provided in which two or more electrical signals are obtained as an analyte traverses a fluidic channel. Detection of the relative position of probes hybridized to a biopolymer and/or the length of the analyte (e.g., a biopolymer) does not rely on the absolute time between detection events of a given electrical signal to determine a distance associated with the biopolymer. Instead, multiple signals are obtained (e.g., as functions of time) corresponding to a plurality of detector volumes at known locations along a fluidic channel through which the biopolymer passes, and the distances are determined from the multiple signals.

RELATED APPLICATION

This application claims the benefit of priority under 35 U.S.C. §119(e) to U.S. Provisional Ser. No. 61/181,907, filed May 28, 2009, the entirety of which is incorporated herein by reference.

FIELD OF INVENTION

The present invention relates generally to biopolymer sequencing. More particularly, in certain embodiments, the invention relates to determining the length of biopolymers and the distances of probes bound to the biopolymer.

BACKGROUND

A number of different approaches for sequencing nucleic acids exist. The traditional methods are the dideoxy-chain termination method described by Sanger et al., Proc Natl. Acad. Sci. USA, (1977) 74: 5463-67 and the chemical degradation method described by Maxam et al., Proc. Natl. Acad. Sci. USA, (1977) 74: 560-564. Of these two methods, the Sanger procedure has been the most widely used. The original Sanger method relied on radioactive labeling of the reaction products and separation of the reaction products by slab gel electrophoresis.

Both the Sanger and Maxam methods are time and labor intensive. The start of the Human Genome Project was the impetus for the development of improved, automated systems to perform Sanger sequencing. As a result, detection of fluorescence has replaced autoradiography and capillary electrophoresis has replaced the ultrathin slab gels originally used to separate reaction products. Automated sequencers have been developed and are capable of processing large numbers of samples without operator intervention.

The completion of the Human Genome Project has refocused the community on the need for new technologies that are capable of rapidly and inexpensively determining the sequence of human genomes. There is has been much discussion in recent years about personalized medicine. The vision of personalized medicine involves every individual having his or her complete genome sequenced at high accuracy and using this information to guide clinical care, specifically for risk stratification of patients and pharmacogenomics.

In recent years, a number of technological advances have been developed enabling a great reduction in the cost of sequencing and substantially increasing the amount of sequence data produced. All of the sequencing methods currently available utilize optical detection for the determination of the DNA sequence. The most prevalent sequencing methods are referred to as sequencing by synthesis (SBS).

Typical embodiments of SBS consist of the stepwise synthesis of a strand of DNA that is complementary to a template sequence from the target genome to be sequenced. The SBS methods can be divided into those that are performed in batch mode and those that are performed in real-time. The batch mode processes rely on the stepwise synthesis of the new DNA strand with the limitation that the synthesis is only allowed to proceed for one nucleotide position, for one nucleotide type, or for the combination of one nucleotide position and one nucleotide type. The incorporation of the nucleotide occurs in parallel for large numbers of templates and are detected using a variety of methods.

Embodiments of the batch mode utilizing a single nucleotide type are used by Roche for pyrosequencing with the 454 platform. The Roche technology (see, e.g., Margulies et al. (2005) Nature, 437:376-380; U.S. Pat. Nos. 6,274,320; 6,258,568; 6,210,891) utilizes pyrosequencing. The method depends on several enzymes and cofactors to produce luminescence when a nucleotide is incorporated. A single nucleotide species is introduced into a large number of small reaction vessels each containing multiple copies of a single template. The incorporation of the nucleotide are accompanied by light emission. When the reaction has run to completion, the reagents are washed from the reaction volumes and a next nucleotide and required reagents is washed into the reactions. Each template is thus extended in an iterative fashion, one nucleotide at a time. Multiple incorporations of the same nucleotide require the quantitative determination of the amount of light emitted. Homopolymer tracts in templates may be difficult to accurately sequence as the incremental amount of light emitted for each subsequent position in the homopolymer becomes small compared to the total amount emitted.

In a second embodiment of the SBS method, platforms by Helicos (see, e.g., Quake et al Proc. Nat. Acad. Sci. USA (2003) 100: 3960-3964. U.S. Pat. Nos. 6,818,395; 6,911,345; 7,297,518; 7,462,449 and 7,501,245), Illumina (see, e.g., Bennett et al. Pharmacogenomics (2005) 6:373-382), and Intelligent Bio-Systems (see, e.g., Ju et al. Proc. Nat. Acad. Sci. USA (2006) 103:19635-19640) allow only the incorporation of a single nucleotide at each step. Template strands are attached to a solid support and a primer sequence is annealed. A polymerase used to extend the primer to make a complement to the template. The nucleotides are derivatized such that after the incorporation of a single nucleotide, the growing strand is incapable of further extension. The nucleotides are further derivatized to make them fluorescent. In the Helicos technology, the four nucleotides are labeled with the same fluorescent tag. This requires that each nucleotide type be added separately. In contrast, the Illumina and Intelligent Bio-Systems technologies utilize four different fluorescent tags so that a mixture of all four derivatized nucleotides may be added at the same time. For both technologies, the incorporation of a nucleotide is accompanied by the appearance of fluorescence in the growing strand. In the case of Illumina, the wavelength of the fluorescence emission indicates the identity of the newly incorporated nucleotide. In the Helicos technology, only a single nucleotide type is added at each cycle. Thus, the appearance of fluorescence at a position on the solid support indicates the incorporation of the added nucleotide for that template. Templates that do not incorporate the nucleotide present in the reaction remain dark.

Following the observation of any incorporated fluorescence, the blocking groups and fluorescent tags are removed prior to the next cycle. Multiple cycles result in the acquisition of sequence data for many templates in a single run. The instrumentation typical for these technologies allows for the automated acquisition of sequence information for hundreds of thousands to millions of templates in parallel.

SBS methods may also be performed in real-time. In this embodiment, polymerase is used to incorporate fluorescently labeled nucleotides and the fluorescence is observed during DNA strand synthesis. The four nucleotides are labeled with different fluorescent tags. The fluorescent tags are attached to the terminal phosphate of the nucleotide triphosphate. During incorporation of the nucleotide into the growing strand the fluorophore is released to solution and the growing strand remains non-fluorescent. The identity of the incorporated strand is determined while the nucleotide resides in the active site of the enzyme and before the cleaved diphosphate is released to bulk solution.

The fluorescence of the incorporated nucleotide typically is measured in a background fluorescence from a much larger concentration of unincorporated nucleotide. Pacific Biosystems (see, e.g., U.S. Pat. Nos. 7,170,050; 7,302,146; 7,315,019; 7,476,503; and 7,476,504) identifies the incorporated nucleotide based on the residence time in the polymerase active site. Fluorescence emission from the active site for an appropriate time indicates incorporation and the emission wavelength determines the identity of the incorporated nucleotide. Polymerase is attached to the bottom of zero-mode waveguides. Zero-mode waveguides are reaction cells whose dimensions limit the fluorescence excitation to the evanescent wave from the light source. Thus, only fluorescent tags close to the bottom surface of the reaction volume are excited.

Visigen identifies the incorporated nucleotide through Fluorescent Resonant Energy Transfer (FRET) between an acceptor in the polymerase active site and a fluorescent tag on the nucleotide (see, e.g., U.S. Pat. Nos. 7,211,414 and 7,329,492). Only nucleotides held in the active site of the polymerase show fluorescence. Incorporation is identified by the residence time of the fluorescence in the active site and the nucleotide identity is determined by the emission wavelength.

Other recently developed methods to sequence DNA rely on hybridization and ligation. Both the SOLiD and Complete Genomics technologies rely on the combination of hybridization and ligation. The SOLiD system (Life Technologies) immobilizes short template strands via an adapter. A primer and a pool of labeled oligonucleotides containing two fixed positions and six degenerate positions is hybridized to the template. The primer hybridizes to the adaptor. The pool consists of 65,536 (4̂8) different sequences. Four fluorescent dyes are used to label the oligonucleotides in a fashion that creates four subsets based on the sixteen combinations at the two fixed positions. Thus, each fluorescent tag is associated with 4 of the sixteen possible combinations. Following hybridization, a ligase is added and any probes in the pool that hybridized contiguously with the primer are ligated to the primer. The fluorescence of the hybridized and ligated product is determined. The fluorescence defines which subset of sequences hybridized to the template and ligated to the primer. The terminal three bases and the associated fluorescent tag are cleaved from the hybridized and ligated oligonucleotide. Subsequent rounds of another round of hybridization, ligation, and cleavage are performed. In this first series of reactions, each cycle identifies a subset for the pair of nucleotides in the template that is 5 nucleotides downstream from subset of 4 pairs that were identified in the last cycle. After several cycles, the primer, and the oligonucleotides that have been ligated to it, is washed off the template

The entire procedure is repeated starting with a primer that is one nucleotide shorter than the original primer, then with primers that are two, three, and four nucleotides shorter than the original primer. These subsequent rounds shift the frame of interrogation so that the bases that make-up the template strand can be identified from the union between the two subsets of reaction that overlapped at that position.

Complete Genomics technology utilizes a similar hybridization and ligation method (see, e.g., US Patent Application Publication Nos. 20080234136; 20090005252; 20090011943; and 20090176652). In the Complete Genomics technology, a primer is hybridized to an adaptor that is attached to the end of the template. A series of pools of oligonucleotides is constructed. In each pool, the nucleotide at a single position is identified by using four-color fluorescence. The remaining positions are degenerate. The first pool is hybridized to the template. Oligonucleotides that hybridize adjacent to the primer are subsequently ligated. After washing excess oligonucleotides away, the fluorescence of the ligated oligonucleotide identifies the nucleotide at the defined position in that pool. The ligated primer and oligonucleotide are washed off the template and the process is repeated with the next pool of oligonucleotides that probe the next position down from the primer.

The SBS and hybridization-ligation methods generate short pieces or reads of DNA sequence. While the short reads can be used to re-sequence human genomes, they are not favorable for the de novo assembly of human genomes. With the recent realization that human genomes contain large numbers of inversions, translocations, duplications, and indels (e.g., mutations that include both insertions, deletions, and the combination thereof), the quality of human genome data from short reads is even more suspect. Genetic rearrangements are even more prevalent in cancer.

While embodiments of the short read technologies that incorporate paired-end reads have been proposed and the length of the sequence data from these technologies has increased incrementally over the last two years, it is clear that longer read technologies are necessary for the accurate assembly of human genome data.

In addition to the undesirable nature of short reads, all of the extant DNA sequencing methods employ optical detection. The throughput of optical methods limits the ultimate performance characteristics of any of these sequencing technologies. Optical methods are capable of identifying single molecules. However, the time required to observe and accurately identify events will remain too slow to meet the need for higher throughput. While the current generation of sequencing technologies has lowered the cost of sequencing by orders of magnitude as compared to the methods used to sequence the first human genomes, the methods remain too slow and costly for routine analysis of human genomes.

A need therefore exists for efficient methods and devices capable of rapid and accurate nucleic acid sequencing for de novo assembly of human genomes. It is desirable to have long read lengths and to use as little nucleic acid template as possible. Moreover, single-molecule optical detection of DNA has limitations with respect to sensitivity and speed.

The use of electronic detection applied to DNA sequencing may help overcome the limitations associated with single-molecule detection. For example, Hybridization-Assisted Nanopore Sequencing (HANS), which uses nanopores to detect and locate the position of hybridization events (e.g., hybridized probes on a biopolymer), is expected to yield highly accurate DNA sequence information, with long read lengths. The HANS method relies on detecting the position of hybridized probes on single molecules of the biopolymer to be sequenced or characterized. The resulting positional hybridization data is used to reconstruct sequence information of the target strand. The process for sequence reconstruction is similar to that for reconstructing sequence data from Sequencing by Hybridization (SBH) experiments with the important difference that the addition of positional information removes the inherent mathematical limitations of SBH and results in successful reconstruction of extremely long sequences.

The HANS method provides a number of benefits over other proposed sequencing technologies. For example, the inherent nature of reconstructing data from multiple overlapping hybridization events reduces errors. Further, the rapid nature of the sensing allows for higher accuracy since coverage can be extensive without significantly impacting the timely production of data. Thus, a significant benefit of the HANS approach is the long read lengths obtainable by the method, which may be used to identify genomic rearrangements and/or reconstruct haplotypes from diploid organisms or separate genomes of related mixtures of, for instance, viral or microbial species.

In the HANS method, two reservoirs of solution are separated by a nanometer sized hole, or nanopore, that serves as a fluidic constriction of known dimensions. The application of a constant DC voltage between the two reservoirs results in a baseline ionic current that is measured. If an analyte is introduced into a reservoir, it may pass through the fluidic channel and change the observed current, due to a difference in conductivity between the electrolyte solution and analyte. The magnitude of the change in current depends on the volume of electrolyte displaced by the analyte while it is in the fluidic channel. The duration of the current change is related to the amount of time that the analyte takes to pass through the nanopore constriction.

In the case of DNA translocation through a nanopore, the physical translocation is driven by the electrophoretic force generated by the applied DC voltage between the two reservoirs. This driving force and the detected signal are, typically, inseparably coupled. A higher signal-to-noise ratio may be obtained by using higher voltages, but this may also result in a faster translocation rate of the analyte through the nanopore. The faster translocation rate may reduce the duration of the current change when analyte passes through the pore and thus the current change may be harder to detect because of bandwidth limitations in the current sensing electronics.

DNA can also be translocated through nanochannels by applying a DC voltage See, e.g., Riehn, R. et al. Proc. Nat. Acad. Sci. 2005, 102, 10012, which is incorporated herein by reference in its entirety. Detection of DNA molecules in a nanochannel has been accomplished by applying a current through electrodes that are perpendicular to the nanochannel. See Liang, X.; Chou, S. Y. Nano Lett. 2008, 8, 1472, which is incorporated herein by reference in its entirety. As DNA passes between the electrodes the observed current passing between two electrodes disposed on the opposite side of the channel changes. The length of the DNA strand may be inferred from the time of passage of the strand past the electrodes. However, the spread in the data indicates that there may be significant error in the calculation of the length of the DNA if one only uses the duration of the signal.

As discussed above, the distance between locations of hybridization of sequence selective probes in the HANS method is inferred from the time between translocations of the hybridized portions through the nanopore. While extremely sensitive as single-molecule detectors, solid-state nanopores have a number of inherent limitations for the characterization of DNA strands. Translocation times are rapid through nanopores thus necessitating a tagging scheme for probes. The design of the nanopore generally precludes multiple measurements of a single molecule unless a capture-recapture technique is utilized (Gershow, M.; Golovchenko, J. A. Recapturing and trapping single molecules with a solid-state nanopore. Nature Nanotech 2007, 2, 775-779, which is incorporated by reference in its entirety). The translocation times of DNA fragments, all of the same length, have relatively large distributions. While each of these problems can be resolved for the implementation of HANS with nanopore detectors, it is apparent that improvements in the detector will make HANS development faster and will lead to higher sequencing accuracy and throughput.

Another limitation to solid-state nanopores is the fabrication of devices. Currently, each pore is fabricated by using a transmission electron microscope (TEM). From both a time and cost standpoint, this method would be prohibitive for the construction of large arrays of nanopores.

An alternative detector in a nanochannel utilizes a 4-point sensing element to separate the detection element and the electrophoretic driving elements. This detector also infers the relative positions of hybridized probes from the time between passage of subsequent probes through the detector. However, the difficulty of determining the biopolymer's passage rate lowers the resolution of sequencing data.

Thus, there remains a need for improved devices and methods for sequencing biopolymers.

SUMMARY

Embodiments of the present invention provide improved devices and methods for determining the length and sequences of analytes (e.g., biopolymers) by detecting two or more electrical signals from an analyte as the analyte traverses a fluidic channel without having to rely on an absolute correlation between time and position. Two or more electrical signals are detected by disposing a plurality of sensing electrodes along a length of a fluidic channel. The detection of two or more electrical signals from separate sensing electrodes enables more accurate determination of analyte length and sequence. Thereby, the distance between hybridization events on a target biopolymer can be more accurately determined by using coincident detection events. The more accurate determination of distance between hybridizing probes enables more accurate sequencing by the HANS method.

More particularly, embodiments of the present invention may utilize multiple pairs of nanoscale electrodes for electronic sensing of analytes, e.g., DNA in fluidic channels. The sensing elements in the fluidic channel may be used to determine the length of the analyte or they may be used to determine the distance between probes hybridized to a target strand of DNA. The device design is similar to nanochannel devices used for optical detection. Two micro-scale liquid reservoirs may be fabricated at a distance of 100 to 200 μm. One or more fluidic channels may connect the two reservoirs. A cap may be fabricated by drilling holes that will allow fluid introduction to each reservoir and to provide access for macroscopic electrodes. Along the length of the fluidic channels, a series of sensing elements are fabricated. Each sensing element may be composed of two electrodes that bisect the fluidic channel. In use, a voltmeter may be used to monitor the potential difference for each pair of sensing electrodes.

The DNA to be analyzed may be introduced to one of the microfluidic reservoirs. Macroscopic electrodes may be connected to a power supply and used to apply a potential between the two reservoirs. DNA fragments may be electrophoretically driven from the microscopic reservoir into the nanochannels. As each DNA fragment moves down the fluidic channel, it may enter and exit each of the sensing elements disposed in the fluidic channel.

In the absence of DNA, the detector volumes contain only the ionic solution and typically have a baseline potential difference measured between the two electrodes that make up the sensing element. As DNA enters the detector volume, the potential measured between the two sensing electrodes may change because the DNA has a conductivity different from that of the ionic solution. When DNA enters the detector volume, the conductivity of the channel between the two sensing electrodes will typically be reduced as DNA is less conductive than the buffer solution (See de Pablo, P. J.; Moreno-Herrero, F; Colchero, J.; Gomez-Herrero, J.; Herrero, P.; Baro, A. M.; Ordejon, P.; Soler, J. M.; Artacho, E. Absence of dc-Conductivity in Phys. Rev. Lett. 2000, 85, 4992-4995, which is incorporated by reference in its entirety). When a portion of the DNA that has a probe hybridized to the DNA enters the sensing volume the potential may change further.

The DNA fragment has two or more positions where probes are hybridized. As the DNA moves down the fluidic channel, the locations where the probes are hybridized to the DNA also move through each of the detector volumes in turn. Each sensing element is monitored and the potential difference between the two sensing electrodes making up the sensing element is recorded. When two or more probes, located at different positions on the target DNA, reside in different detector volumes at the same time, the distance between the probes can be determined. The distance between any two probes may be determined by noting which pairs of detector volumes shows detection for that pair of probes, e.g., coincident detection. Appropriate spacing of the sensing electrodes and sensing elements increases the likelihood that a pair of probes show coincident detection in two detector volumes.

The error in the determination of the length of the analyte is dependent on the accuracy to which the distances between sensing electrodes is known or can be measured, either directly or through a calibration step. The determination of the distances between hybridization events relies on knowledge of the distance between sensing electrodes and does not require that the biopolymer move with constant velocity through the device. The spacing between sensing electrodes may be chosen to vary in a fashion that enables the determination of a variety of different distances.

If two probes do not fill different detector volumes at exactly the same time, the time between the exit of one probe from a detector volume and the entrance of a different probe to the detector volume can be used to make adjustments to the distance between the two affected sensing elements to determine the distance between probes. In this case, the time used to correct the distance information should be small and will not add significant error to the distance measurement. Again, it is important to choose spacing for the sensing electrodes such that errors are reduced.

In an aspect, the invention features a device for determining the length of an analyte by detecting electrical signals. The device may comprise a fluidic channel and a plurality of sensing electrodes disposed along a length of the fluidic channel. A fluidic channel may be defined in a substrate. A plurality of sensing electrodes may be disposed along a length of the fluidic channel for detection of two or more electrical signals corresponding to two or more detector volumes disposed along the fluidic channel. The sensing electrodes may be configured for connection to a measurement tool for capturing the electrical signals corresponding to the detector volumes. The fluidic channel may include or consist essentially of a nanochannel or a microchannel. The relative positions of the sensing electrodes may be known. The captured electrical signals in conjunction with the relative positions of the sensing electrodes may indicate the length of the analyte.

One or more of the following features may be included in a device for determining a length of an analyte by detecting electrical signals. A data collection device may be configured for recording electrical signals captured by the measurement tool as a function of time. A computer may be in electrical communication with the data collection device, the computer programmed to determine which detector volumes record a change in electrical signal at the same time. An electronic circuit may be configured to output a signal only when two electrical signals corresponding to two detector volumes change at the same time.

A pair of electromotive electrodes may be disposed at a first end and a second end of the fluidic channel. The pair of electromotive electrodes may include macroscopic electrodes arranged to generate a constant, changing, or oscillating electrophoretic force in the fluidic channel for translocation of the analyte disposed therein.

The device may be configured such that positive pressure and/or a chemical gradient drives the analyte through the fluidic channel.

The substrate may include or consist essentially of silicon, silicon dioxide, fused silica, and gallium arsenide. At least one of the sensing electrodes may include or consist essentially of platinum, gold, chrome, titanium, silver chloride, silver, and graphene.

A sensing element corresponding to a given detector volume may include two sensing electrodes disposed on opposing sides of the fluidic channel and/or two sensing electrodes disposed on a first side of the fluidic channel and/or two sensing electrodes transversing the fluidic channel. A sensing element corresponding to a given detector volume may include a first sensing electrode transversing the fluidic channel, and a second sensing electrode on a side of the fluidic channel.

The measurement tool may include at least one of a voltmeter, an ammeter, or a field-effect transistor.

A plurality of fluidic channels may be defined in the substrate. A voltage amplifier may be configured to amplify the two or more electrical signals.

The fluidic channel may have a width selected from a range of 1 nm to 5 μm and/or a depth selected from a range of 1 nm to 5 μm and/or a length selected from a range of 1 μm to 10 cm.

The device may include at least three pairs of sensing electrodes, and a distance between detector volumes defined by a first and second pair of sensing electrodes may be unequal to a distance between detector volumes defined by the second and third pair of electrodes.

The device may comprise at least three pairs of sensing electrodes, a detector volume defined by a first pair of sensing electrodes may be unequal to a detector volume defined by a second pair of sensing electrodes, and a detector volume defined by the second pair of sensing electrodes may be unequal to a detector volume defined by a third pair of sensing electrodes.

Each detector volume may be associated with one or more sensing electrodes.

The description of elements of the embodiments of other aspects of the invention may be applied to this aspect of the invention as well.

In another aspect, the invention features a method for determining a length of an analyte. The method includes disposing the analyte in a fluidic channel. A potential is applied along the fluidic channel. The analyte is translocated from a first end of the fluidic channel to a second end of the fluidic channel. Two or more electrical signals are detected as the analyte moves through the fluidic channel, the two or more electrical signals corresponding to two or more detector volumes of the fluidic channel. The two or more electrical signals are detected using a plurality of sensing electrodes disposed along the length of the fluidic channel. The length of the analyte is determined by analyzing the two or more detected electrical signals. The fluidic channel is a nanochannel or a microchannel.

One or more of the following features may be included in a method for determining the length of an analyte. Applying the potential along the fluidic channel may generate an electrophoretic force therein. The analyte may be translocated by using a chemical gradient and/or a pressure differential.

Determining the length of the analyte may include identifying at least two detector volumes in which the analyte is sensed at a given time, and determining a distance between sensing electrodes corresponding to the at least two detector volumes.

An amount of analyte that partially fills the detector volume may also be determined by comparing the electrical signal caused by the analyte to a maximum signal caused by a sample biopolymer long enough to fill the detector volume entirely.

A correction factor may be applied to a measured length to determine an actual length of the analyte. The analyte may include a biopolymer such as deoxyribonucleic acids, ribonucleic acids, and/or polypeptides. The biopolymer may be a single-stranded molecule. A portion of the analyte may be at least partially hybridized, such that the detected electrical signals indicate the presence of a probe bound to the analyte.

The description of elements of the embodiments of other aspects of the invention may be applied to this aspect of the invention as well.

In yet another aspect, the invention features a method for determining a sequence of a biopolymer, including preparing an analyte by hybridizing a first plurality of probes, each with specificity for the same sequence on the biopolymer, with the biopolymer such that the first plurality of probes attaches to portions of the biomolecule to produce a partially hybridized biomolecule.

The analyte may be disposed in a fluidic channel. A potential may be applied along the fluidic channel. The analyte may be translocated from a first end of the fluidic channel to a second end of the fluidic channel. Two or more electrical signals may be detected as the analyte moves through the fluidic channel, the two or more electrical signals corresponding to two or more detector volumes of the fluidic channel. The two or more electrical signals may be detected by using a plurality of sensing electrodes disposed along the length of the fluidic channel, the detected electrical signals indicating locations of the hybridized probes along the biopolymer. The electrical signals may be analyzed to determine in which detector volumes probes bound to the biomolecule are located. At least a portion of the sequence of the biopolymer may be determined using a distance between sensing electrodes corresponding to the detector volumes in which the probes are located. The fluidic channel may be a nanochannel or a microchannel.

One or more of the following features may be included. Applying the potential along the fluidic channel may generate an electrophoretic force therein. The analyte may be translocated by using a chemical gradient and/or a pressure differential. A distance between probes may be determined by using a coincident response of electrical signals corresponding to two or more detector volumes. A spacing between sensing electrodes may be used to determine a maximum and/or minimum distance between probes.

The electrical signal may initially change when the biopolymer moves through a detector volume associated with two sensing electrodes and further change when a portion of the biopolymer including a hybridized probe moves through the detector volume.

Determining the sequence of the biopolymer may include using a computer algorithm to process the two or more electrical signals.

A second plurality of probes may be hybridized with the biopolymer either subsequent to, or in parallel with, the hybridization of the first plurality and the detecting, analyzing, and determining steps may be repeated with the second plurality of probes.

The electrical signals may be used to detect and record complexed and uncomplexed regions of the biopolymer to create a first probe map of the first plurality of probes and a second probe map of the second plurality of probes, the first probe map and the second probe map respectively including information about the relative positions of the hybridized first and second plurality of probes.

A more efficient sequencing embodiment utilizes pools of probes rather than a single probe. Thus, the biopolymer to be sequenced may be hybridized with a collection of two or more probes each having a known and different specificity for a sequence on the biopolymer. The different probes may be tagged in a fashion that allows the devices disclosed herein to determine which probe is present on the biopolymer at each instance of hybridization. Alternatively, the identity of each probe is unknown. The identity of one or more of the probes in the pool may be determined subsequently by hybridizing a probe that has an overlapping or otherwise related sequence specificity.

For example, a pool containing two probes with sequences ACTGCC and TAAGTC may be hybridized with a target and the distances between all instances of hybridization determined. Subsequently a pool of probes consisting of CTGCCA, CTGCCC, CTGCCG, and CTGCCT may be hybridized with the same target sequence. The second pool of probes may hybridize to all of the positions that were hybridized by the probe ACTGCC in the first pool, as well as to additional sequences. The hybridization of both pools in separate experiments may allow a high confidence determination of which hybridization event was related to each of the probes in the first pool. Other pooling schemes may be envisaged by those skilled in the art.

A candidate sequence may be determined by ordering at least two probe map sequences using at least one probe map of positional information or a combination of overlapping probe map sequences and positional information. The first and second probe maps may include information about an error of the positional information for each probe.

A candidate sequence may be determined by ordering at least two probe sequences using at least one of (i) positional information and parameters relating to the error in positional information or (ii) a combination of overlapping sequences of the probe molecules and positional information and error in positional information.

The biopolymer may include a double-stranded biopolymer target molecule.

Preparing the analyte may include contacting the biopolymer with a first probe having a first probe specificity for recognition sites of the biopolymer to form a first plurality of local ternary complexes. The first probe may have a first known recognition site sequence. The electrical signals may be used to determine positional information of the first plurality of local ternary complexes. Preparing the analyte may include contacting the biopolymer with a second probe having a second probe specificity for recognition sites of the biopolymer to form a second plurality of local ternary complexes. The second probe may have a second known recognition site sequence different from the first known recognition site.

Positional information of at least the first and second pluralities of local ternary complexes may be aligned to determine a sequence of the biopolymer.

The description of elements of the embodiments of other aspects of the invention may be applied to this aspect of the invention as well.

In yet another aspect, embodiments of the invention include a system for determining a length of an analyte, the system including an analyzing module that determines the length of the analyte based at least in part on a plurality of electrical signals captured by the devices described above.

The description of elements of the embodiments of other aspects of the invention may be applied to this aspect of the invention as well.

In another aspect, embodiments of the invention include an apparatus for determining a length of an analyte. The apparatus may include (a) a memory that stores code defining a set of instructions; and (b) a processor that executes the instructions thereby to determine the length of the analyte from two or more detected electrical signals captured by the devices described above.

The description of elements of the embodiments of other aspects of the invention may be applied to this aspect of the invention as well.

In another aspect, embodiments of the invention include a system for sequencing a biopolymer. The system may include (a) a fluidic channel defined in a substrate; (b) a plurality of sensing electrodes disposed along a length of the fluidic channel for detection of two or more electrical signals corresponding to two or more detector volumes disposed along the fluidic channel. The fluidic channel may be configured such that a biopolymer with at least a first plurality of probes attached thereto may pass therethrough, the sensing electrodes may be configured for connection to a measurement tool for capturing the electrical signals corresponding to the detector volumes as the biopolymer passes through the fluidic channel. The fluidic channel may be a nanochannel or a microchannel, and relative positions of the sensing electrodes may be known. The system may also include an analyzing module that determines at least a portion of the sequence of the biopolymer based at least in part on a plurality of the captured electrical signals. The analyzing module may be configured to perform one or more steps of the method described above.

In still another aspect, embodiments of the invention may include an apparatus for sequencing a biopolymer. The apparatus may include (a) a fluidic channel defined in a substrate; and (b) a plurality of sensing electrodes disposed along a length of the fluidic channel for detection of two or more electrical signals corresponding to two or more detector volumes disposed along the fluidic channel. The fluidic channel may be configured such that a biopolymer with at least a first plurality of probes attached thereto may pass therethrough. The sensing electrodes may be configured for connection to a measurement tool for capturing the electrical signals corresponding to the detector volumes as the biopolymer passes through the fluidic channel. The fluidic channel may be a nanochannel or a microchannel. The relative positions of the sensing electrodes may be known. The captured electrical signals in conjunction with the relative positions of the sensing electrodes indicate at least a portion of the sequence of the biopolymer.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a schematic diagram illustrating a longitudinally displaced transverse electrode device configuration;

FIG. 2 is a schematic diagram illustrating a longitudinally displaced transverse electrode device configuration;

FIG. 3 is a schematic diagram illustrating a longitudinally displaced continuous transverse nanoscale electrode device configuration;

FIG. 4 is a schematic diagram illustrating a longitudinally displaced nano scale electrode device configuration with electrodes disposed on the same side of a channel;

FIG. 5 is a schematic diagram illustrating a longitudinally displaced transverse electrode configuration with two pairs of electrodes disposed in a channel;

FIG. 6 is a schematic diagram illustrating a device configuration allowing the monitoring of changes in current, in accordance with an embodiment of the invention;

FIG. 7 a is a graph and schematic diagram illustrating a fluidic channel transversed by 6 electrodes; an electrical signal is recorded as the biopolymer enters each detector volume (i.e., a volume associated with two electrodes);

FIG. 7 b is a graph illustrating an electrical signal recorded as the biopolymer enters a detector volume;

FIG. 7 c is a graph illustrating the slope of the electrical signal that reflects first the entry of a biopolymer into a detector volume, and then the entry of a hybridized probe into the same detector volume;

FIGS. 8 a-8 i are graphs and schematic diagrams illustrating a biopolymer with hybridized probes being translocated from a first end to a second end of a fluidic channel disposed by 6 transverse sensing electrodes and the resultant electrical signals;

FIG. 9 a is a schematic diagram illustrating a fluidic channel transversed by numerous perpendicular sensing electrodes;

FIG. 9 b is a schematic diagram illustrating an arrangement of sensing electrodes that allows for measurement of small lengths (increments of 100) over total lengths of 7000;

FIG. 9 c is an illustration of a Golomb ruler of length 6;

FIG. 10 is a schematic diagram illustrating an alternative arrangement of electrodes with varying distances between electrodes;

FIGS. 11 a-11 g are graphs and schematic diagrams illustrating a biopolymer with hybridized probes being translocated in a fluidic channel with sensing electrodes having varying distances therebetween;

FIG. 12 is a schematic depiction of a DNA molecule;

FIG. 13 is a schematic depiction of an RNA molecule;

FIG. 14 is a schematic depiction of a hybridizing oligonucleotide (or probe);

FIG. 15 is a schematic depiction of a single-stranded DNA molecule hybridized with a probe;

FIG. 16 is a schematic depiction of an apparatus employed in a method of an embodiment of the present invention;

FIG. 17 is a close-up view of a hybridized biomolecule translocating through the nanopore of the apparatus in FIG. 16; and

FIG. 18 depicts the results from a repetitive application of a method of the present invention using different probes.

DETAILED DESCRIPTION

Nanoscale pores that allow passage of biomaterials have offered a tool for the analysis of biopolymers by determining the distance between hybridization sites. However, such nanopores have suffered from a lack of accuracy because the accurate determination of distance in the previously described methods relies on a constant velocity of the biopolymer through the nanopore during the measurement. Extending nanopores into fluidic channels (e.g., nanochannels) containing multiple sensing electrodes provides new uses and a new level of precision in biopolymer analysis.

The construction of a fluidic channel device incorporating a plurality of electrode pairs as voltage or current detectors along its length is described below. These electrode pairs can detect changes in the conductivity of the fluid volume between them as biopolymers pass through a fluidic channel. Simultaneous changes in the conductivity at distant electrode pairs may allow for the determination of the length of the biopolymer. The degree of these changes may also allow for the determination of the location of probes on the biopolymer. This determination may be used in the sequencing and identification of biopolymers.

The technology disclosed herein allows the determination of biopolymer length and distances between hybridization positions, independently of the velocity of the biopolymer.

As used herein, a “probe” means any molecule or assembly of molecules capable of sequence-specific covalent or non-covalent binding to a target molecule. A probe may be, but is not limited to, a DNA sequence, an RNA sequence, antibodies or antibody fragments. The terms “nucleotide” and “base” are used interchangeably and mean a molecule consisting of a phosphate group, a sugar and one of five nitrogen-containing bases that can make up DNA or RNA polynucleotide chains or strands. For DNA, the nitrogen-containing bases include cytosine (C), adenine (A), guanine (G) and thymine (T) and the sugar is a 2-deoxyribose. For RNA, the deoxyribose sugar is replaced by a ribose sugar instead of deoxyribose and uracil bases (U) instead of thymine bases (T).

A DNA probe “library” is a collection of DNA probes of a fixed length which includes a large number of, or possibly all, possible sequence permutations. A plurality of probes may be made up of multiple copies of the same probe with the same sequence selectivity or be made up of two or more probes with different sequence selectivity.

A “probe map” means a data set containing information related to the sites along a target sequence at which a probe preferentially binds. The data set may include absolute positional information referenced to a known sequence, relative information related to distances between binding sites, or both.

Error in the length information is the uncertainty of the final length of a biopolymer. This uncertainty may arise from a discrepancy between multiple readings or may be an inherent part of the system based on the placement of sensing electrodes. The error may also result from the behavior of the analyte in the fluidic channel (e.g., a nanochannel). Non-uniform coiling, kinking, bending, and stretching of the analyte may contribute to the error. The determination of the error in the length may be determined by statistical analysis.

Error in positional information is the uncertainty of the distance between hybridized molecules. This uncertainty may arise for the same reasons as for the uncertainty in the length information. The determination of the error in the distance between probes may be determined by statistical analysis.

A partially hybridized biomolecule is created when the entire length of a sequence-selective probe binds to a portion of the length of the target biomolecule.

A sensing volume is the volume of electrolyte between two sensing electrodes, through which resistance or voltage is measured by the sensing electrodes. The data set may be stored in computer media. Further details of the characteristics of probe and spectrum maps may be found in U.S. Patent Publication No. 2009-0099786 A1, which is incorporated herein by reference in its entirety.

A “target,” i.e., the analyte, is a biopolymer, of which length, identity or sequence information is to be determined using embodiments of the present invention. The analyte may be a biopolymer, such as a deoxyribonucleic acid, a ribonucleic acid, proteins, or a polypeptide. The target DNA may be single- or double-stranded. In some embodiments, the analyte is a biopolymer to which probes have been hybridized.

Fabrication of Fluidic Channel and Sensing Electrodes

Various electrical signals may be detected with the sensing electrodes described herein. In some embodiments, the detected electrical signal may be a voltage. Examples of such configurations are described in detail below with respect to FIGS. 1-5. In other embodiments, a current or another electrical signal may be detected (see FIG. 6).

In an embodiment, FIG. 1 shows a device (also referred to herein as a system or apparatus) 100 including a fluidic channel 105, e.g., a micro- or nanochannel, a pair of electromotive electrodes 110, 110′, and a pair of sensing electrodes 115A, 115B. The sensing electrodes 115A, 115B may be in electrical connection with an electrical signal detector 120 such as a voltmeter. The fluidic channel 105 may be defined in a substrate comprising silicon, silicon dioxide, fused silica, and/or gallium arsenide. The fluidic channel may contain an electrolytic solution, with electromotive electrodes 110, 110′ being disposed on a first and a second end of the fluidic channel.

The electromotive electrode 110, 110′ pair may include at least one anode 110′ and cathode 110 in contact with the electrolytic solution to provide a constant or changing current to drive the analyte 125 through the fluidic channel. In an alternate embodiment, a pressure differential, such as a positive pressure, may be used to drive the analyte through the fluidic channel. Pressure may be supplied with a fluid pump or with a pressurized gas line. Other methods of applying pressure may be envisioned by one of skill in the art. In some embodiments, a chemical potential gradient may be used to move molecules through the fluidic channel. Chemical potential gradients may be created with concentration gradients. For instance, a fluidic channel may have one end immersed in a fluid that has a higher salt concentration than the fluid at the other end of the fluidic channel. The differential in salt concentration at the ends of the fluidic channel may cause an osmotic pressure that can drive analytes through the fluidic channel.

As the analyte 125, which may be any biopolymer including, but not limited to, polypeptides, DNA or RNA, passes through the fluidic channel 105 it may pass between the pair of sensing electrodes 115A, 115B (each individually referred to herein as “A” and “B”). The sensing electrodes 115A, 115B contacting the fluidic channel 105 are used to monitor the changes in conductance of the electrolytic volume between them. The changes in conductance between the sensing electrodes 115A, 115B may be measured using an electrical signal detector 120, e.g., a voltmeter.

By making the lateral distance, e.g., along a length of the fluidic channel 105, between sensing electrodes 115A, 115B small, the device 100 retains high sensitivity for an analyte 125 passing therethrough. Each sensing electrode 115A, 115B in the pair may be disposed on opposite sides of the fluidic channel 105 as in FIG. 1, where tips of the sensing electrode 115A, 115B in contact with the electrolytic solution are entirely or partially across from one another, or FIG. 2, in which the tips of the sensing electrodes 115A, 115B are not across from each other, but are rather longitudinally displaced with respect to one another by a selected distance. Alternatively, each sensing electrode 115A, 115B in a pair may cross, or transverse, the fluidic channel 105, as shown in FIG. 3. Referring to FIG. 4, in a third arrangement, two sensing electrodes 115A, 115B in a pair may be on the same side of the fluidic channel 105.

The devices 100 described herein may be nanochannel devices formed by the fabrication of a fluidic channel 105 typically having nanoscale dimensions, and the fabrication of nanoscale electrodes. In some embodiments, the fluidic channel may have microscopic dimensions, e.g., may be a microchannel. A typical device may also have a microscale fluidic structure for introduction of buffers and samples. Thus, the techniques described herein employing nanochannels are also applicable to devices including microchannels. Some or all of the structures may also be sealed with a cap in order to provide closed channels.

Fluidic channels may be formed in the substrate by, e.g., lithographic and etch steps. The substrate may be, e.g., a silicon-on-insulator wafer, with, for example, a (100) Si surface, a Si wafer, a fused silica, or a gallium arsenide substrate. Lithography in the sub-100 nanometer (nm) regime may be performed by various techniques, including the following: electron beam lithography (EBL), nanoimprint lithography (NIL) or deep ultraviolet optical lithography (DUV OL). See Liang, X.; Morton, K. J.; Austin, R. H.; Chou, S. Y., Single sub-20 nm wide, centimeter-long nanofluidic channel fabricated by novel nanoimprint mold fabrication and direct imprinting, Nano Lett. 2007, 7, 3774-3780; Austin, M. D.; Ge, H.; Wu, W.; Li, M.; Yu, Z.; Wasserman, D.; Lyon, S. A.; Chou, S. Y., Fabrication of 5 nm line width and 14 nm pitch features by nanoimprint lithography, App. Phys. Lett. 2004, 84, 5299-5301; and Guo, J., Recent progress in nanoimprint technology and its applications, J. Phys. D: Appl. Phys. 2004, 37, R123-R141, which are incorporated by reference herein in their entirety. The current industry standard in micro- and nanofabrication is optical lithography due to its low cost and high throughput. At present, optical lithography has been successfully used in the mass production of devices with critical dimensions as small as 32 nm. EBL and NIL are presently used extensively in academic research environments due to their versatility and capability of producing sub-10 nm features reproducibly. Any of these methods may be used to pattern the fluidic channels described herein.

The removal of material for the formation of the fluidic channels may be performed by, e.g., etching. Wet etching includes the immersion of the material in a solution capable of selective removal. Dry etching, i.e., reactive ion etching (RIE), involves the exposure of the sample to a charged plasma. For the resolution and control required of nanoscale fabrication, RIE is preferable due to its consistency, controllability, and efficiency. Microfluidic channels or reservoirs leading to the nanoscale channels may be etched using either wet or dry methods.

The resulting fluidic channels have preferred dimensions of width and depth ranging from 1 nm to 5 μm. Each of the width and depth are preferably selected from a range of 1 nm to 10 μm, more preferably 10 nm to 100 nm. The fluidic channels may have a length selected from a range of, e.g., 1 micrometer (μm) to 10 centimeters (cm). After the fluidic channels are formed, sensing electrodes are fabricated. Numerous metal deposition techniques suitable for fabrication of electrodes exist in conventional microfabrication process flows. Each technique has positive and negative attributes and a list of the materials that may be deposited using that technique. The three primary techniques are: electron beam evaporation, thermal evaporation, and sputtering. The sensing electrodes have thicknesses ranging from 5 nm to 100 nm at the point where the electrodes intersect the fluidic channels. The electrodes may be wider and/or thicker in regions distal to the fluidic channels and approaching contact pads disposed at the perimeter of the device.

To complete the device, a cap layer may be introduced to prevent evaporation of liquid from the fluidic channel. The cap may be formed over just the nanoscale fluidic paths or over all of the fluidic channels. In the latter case, the cap structure preferably has holes or ports to allow for the introduction of fluid and samples into the fluidic paths. In another embodiment, the entire substrate, i.e., wafer, may be capped. The cap may be made of a glass plate such as borosilicate glass, phosphosilicate glass, quartz, fused silica, fused quartz, a silicon wafer, a gallium arsenide wafer, or other suitable substrates. Various techniques are suitable for accomplishing this step including anodic bonding. In anodic bonding, an underlying silicon wafer and a glass substrate are pressed together and heated while a large electric field is applied across the joint. Anodic bonding has been demonstrated to form a strong bond between a silicon wafer and the capping substrate. Direct silicon bonding has been used to join two silicon wafers. The latter method involves pressing the two wafers together under water. Other methods use an adhesive layer, such as a photoresist, to bond the cap to the substrate.

More particularly, an exemplary fabrication process for defining the fluidic channel and sensory electrodes is as follows. A suitable substrate, such as a conventional (100) p-type silicon wafer, is thermally oxidized in a hydrated atmosphere to grow a thick (e.g., >1 μm) silicon-dioxide (SiO₂) layer. This SiO₂ layer may serve as insulation between subsequently formed adjacent metal electrodes, and may also reduce overall device capacitance.

Using conventional high resolution optical lithography, the pattern of the fluidic channel may be transferred to a first photoresist masking layer. RIE with an anisotropic etch species, such as Cl₂, may be used to transfer the pattern into the SiO₂ layer to define a trench that functions as a fluidic channel in the completed device. The preferred width and depth of the fluidic channel may be determined by the requirements for the device sensitivity. The smaller the volume of the fluidic channel between two electrodes, the more sensitive the device is. Fluidic channel size, width, and depth, may also be determined by the size or behavior of the analyte. In one embodiment, the device described herein is used to detect strands of DNA. It may be desirable to fabricate the fluidic channel with dimensions that extend the DNA strand within the channel. For instance for double-stranded DNA, it has been found that the use of fluidic channels with dimensions of 100 nm or less are able to extend the biopolymer. See Tegenfeldt, J. O. et al. The dynamics of genomic-length DNA molecules in 100-nm channels. Proc. Nat. Acad. Sci. USA, 2004, 101, 10979-10983, which is incorporated by reference herein in its entirety. Upon completion of the dry etch procedure, residual resist is removed and the substrate vigorously cleaned.

Following the etching of the fluidic channel, embedded metal sensing electrodes are fabricated. Conventional high resolution optical lithography may be used to transfer the metal electrode pattern to a second photoresist masking layer. RIE with an anisotropic etch species, such as Cl₂, may be used to transfer the pattern into the SiO₂ layer. The depth of these trenches may be less than, equal to, or greater than the depth of the fluidic channel. In one embodiment, the depth of these trenches exceeds or equals the depth of the fluidic channel. Upon completion of pattern transfer to the SiO₂ layer, a thin metal adhesion promotion layer may be deposited. A suitable layer is tantalum with a thickness of 30-50 Å, deposited via electron beam evaporation. Next, the electrode material is deposited without exposing the substrate to atmosphere. A preferred metal for the bulk of the electrodes is platinum, also deposited via electron beam evaporation. Other examples of suitable metals include gold, chrome, titanium, silver chloride, silver, and graphene. The thickness of the metal is dictated by the depth of the etched fluidic channels, such that the resultant metal trace is approximately planar with a top surface of the SiO₂ layer. Upon completion of the metal deposition, the substrate is immersed in a photoresist solvent that lifts-off excess metal from the surface and the substrate is vigorously cleaned. Chemical-mechanical polishing (CMP) may be performed to remove excess metal extending over the SiO₂ top surface, thereby planarizing a top surface of the metal to be level with the SiO₂ top surface.

To complete the fabrication of the device, a cap layer is preferably adhered to the device surface to provide a leak-free seal, enabling fluidic conduction. Preferred cap materials include borosilicate glass, fused silica, fused quartz, quartz, or phosphosilicate glass. Holes may be created in the cap layer to provide access to fluidic inlet, fluidic outlet and metal electrodes. A typical method for making holes in glass wafers is ultrasonic etching, which allows for highly controllable pattern transfer to glass substrates. Anodic bonding may then be used to bond the glass cap layer to the underlying substrate, e.g., silicon wafer. The anodic bonding of two layers provides a strong and leak-free seal.

An exemplary device with a pair of such nanoscale sensing electrodes is illustrated in FIG. 1, i.e., electrodes 115A, 115B. Electric current is transferred in the form of ionic flow in an electrolyte solution confined in the fluidic channel (e.g., a nanochannel). The role of the electrolyte is to maintain a uniformly distributed electric field in the fluidic channel. Typical electrolyte solutions have been described in applications of electrophoresis to separations of DNA molecules. The most common electrolytes for electrophoretic separation of DNA are Tris boric acid EDTA (TBE) and tris acetate EDTA (TAE). See, e.g., Sambrook, J.; Russell, D. W. Molecular Cloning: A Laboratory Manual 3^(rd) ed. Cold Spring Harbor Press, 2001, which is incorporated by reference herein in its entirety. However, any conductive medium may be used.

Operation of Fluidic Channel

During operation, a current is supplied by applying a potential to a pair of electromotive electrodes 110, 110′, e.g., macroscopic electrodes disposed at opposing ends of the fluidic channel 105 and in contact with the electrolytic solution. The electromotive electrodes 110, 110′ are preferably in electrical communication with wires leading to the ends of the fluidic channels as illustrated in FIGS. 1-6.

The electromotive electrodes 110, 110′ may generate a constant or varying electrophoretic force in the fluidic channel 105 for translocation of an analyte 125 disposed therein. The voltage between the electromotive electrodes 110, 110′ may be constant or it may be changed over the course of a measurement. For instance, it may be desirable to reduce the voltage once a DNA molecule has entered the fluidic channel 105 and before the DNA molecule has entered the volume between the sensing electrodes 115A, 115B, in order to slow the passage of the DNA molecule through the detector volume. Alternatively, a pressure differential and/or a chemical potential gradient may be used to drive the analyte 125 through the fluidic channel 105. Controlling the rate of passage of the DNA molecule through the detector volume allows for more accurate detection, measurement, and analysis of the DNA.

As an example of the placement of sensing electrodes 115A, 115B, a width of 20 nm may be assumed for each of sensing electrodes 115A, 115B in FIG. 1. Electrode 115A may be shifted along the fluidic channel 105 relative to electrode 115B, by, e.g., 10 nm or 30 nm. Distances between sensing electrodes from 30 nm to 100 nm or from 30 nm to 500 nm, or from 30 nm to 5 μm can be incorporated into a single device. For analytes of sufficient length, distances up to e.g., 500 μm may be used, e.g., up to 300 μm, 200 μm, or 100 μm may be used. Although electrodes with any distance therebetween may be fabricated, since DNA is difficult to obtain at a length greater than 500 μm, any electrode distance that is greater than 500 μm may be superfluous, as long as the length of the DNA does not exceed 500 μm. The smaller displacement between electrodes A and B is an example of an embodiment having overlap of the electrodes, even though they are displaced with respect to one another. In some embodiments, such as FIG. 2, there may be no overlap between sensing electrodes 115A, 115B.

The voltage across sensing electrodes 115A, 115B is proportional to the local impedance in the fluidic channel 105 between sensing electrodes 115A, 115B. The spacing of the electrodes is determined by several factors. The smaller the distance between electrodes in a sensing pair, the smaller the detector volume and, all other factors being constant, the smaller the particle that can be detected by the sensing pair. However, fabrication limits may make it difficult to consistently place the electrodes in a pair at small distances. Thus, the selected distance is a trade-off between fabrication reproducibility and sensitivity of the device. The choice of separation distance and thus whether the electrodes are overlapping or non-overlapping depends on these constraints.

The resulting electrode arrangement provides a means to separate the current and voltage probes and may be used to employ 4-point sensing in a fluidic channel. In an embodiment, the electromotive electrodes 110, 110′ at the ends of the fluidic channel 105 provide a current while the nanoscale sensing electrodes disposed across the fluidic channel 105 are used to measure voltage. The sensing electrodes preferably have an output impedance higher than the impedance of the volume being measured.

The following calculations demonstrate the feasibility of this device concept. The fluidic channel 105 may be subjected to a constant electric field equal to the potential difference along the length of the channel divided by the length of the channel, i.e., 100 mV applied longitudinally to a 10 μm long fluidic channel results in a field of 100 mV/10 μm=10 mV/μm or 0.01 mV/nm. The potential difference between electrodes A and B separated by 10 nm is then the product of the distance between electrodes and the electric field or:

10 nm×0.01 mV/nm=0.1 mV.

Similarly, a potential difference of 0.3 mV exists between electrodes A and B when the spacing is 30 nm. Each of these potentials is readily detectable with conventional electronic measurement tools. When a DNA molecule or any other analyte 125 passes between a pair of electrodes, the impedance between the electrodes changes due to a resistivity difference between the electrolyte and the molecule. The resulting transient change in the potential is measured, while maintaining a constant current.

For the example shown in FIG. 1, assuming a constant velocity, the duration of each voltage pulse is proportional to the length of the DNA or other analyte 125 that passes between the two sensing electrodes.

It is important to note that by shifting one of the transverse electrodes along the fluidic channel 105 by a distance of 10-50 nm, and using a fluidic channel 105 with a diameter of about 10 nm, the volume separating the two sensing electrodes may be viewed as having a sensitivity equivalent to that of a conventional solid-state nanopore.

Referring to FIG. 5, multiple pairs of sensing electrodes are disposed along the fluidic channel 105, with, e.g., a pair of electromotive electrodes 110, 110′ with the anode 110′ and cathode 110 each being disposed at one end of the fluidic channel 105, a first pair of sensing electrodes 115A, 115B, e.g., 115A, 115B being disposed in the fluidic channel 105, and a second pair of sensing electrodes 115C, 115D, e.g., 115C, 115D being disposed in the fluidic channel 105 distal to the first pair of electrodes in the direction of the electromotive anode 110′ electrode.

In use, the voltage between a pair of electrodes, e.g., V_(AB), or V_(CD) may be sensed by a measurement tool 120, e.g., a voltmeter, configured to measure the potential difference between the electrode pair. In a preferred embodiment, the voltmeter may be in electrical communication with each of the electrodes in the pair via metal contact pads connected to nanowires leading to the electrodes.

Generally, an analyte 125 may be detected in the fluidic channel 105 as follows. The analyte 125 may be introduced into a fluidic channel 105. A potential is applied along the fluidic channel 105 to generate an electrophoretic force therein. For example, a potential may be applied to electromotive electrodes 110, 110′ disposed at each end of the fluidic channel 105, such that an ionic current is created and the analyte 125 is translocated from a first end of the fluidic channel 105 to a second end of the fluidic channel 105. The electromotive electrodes 110, 110′ may generate a constant or oscillating electrophoretic force in the fluidic channel 105 for translocation of the analyte. A voltage between a pair of sensing electrodes 115A-115D disposed in the fluidic channel 105 is measured as the analyte 125 moves past the pair of sensing electrodes. The voltage between the electromotive electrodes 110, 110′ may be constant or it may be changed over the course of a measurement. For example, the voltage may be reduced once a DNA molecule has entered the fluidic channel 105 and before the DNA molecule has entered the volume between the sensing electrodes 115A-115D, to slow the passage of the DNA molecule through the detector volume.

The analyte, e.g., the biopolymer strand and probes, are transferred from a chamber into the fluidic channel in the electrolytic solution. Typically, an electrolyte may be added to the fluidic channel by a pipette, a syringe, or a pump. An analyte sample size may be as small as practically possible, as the device allows the detection of single molecules. The fluid may wet the fluidic channels by capillary action. Analyte may be introduced into the microscale areas either with the original electrolyte or after by pumping in a new solution. An analyte, such as DNA, which may be hybridized to one or more probes, may be drawn into the fluidics channel by the potential. For small analytes, one may use diffusion, fluid flow, or a potential.

Referring to FIG. 6, an alternative configuration allows one to sense a current as a biopolymer passes through a fluidic channel. Two pairs of sensing electrodes, e.g., 115A, 115B and 115C, 115D are disposed along the fluidic channel, with each pair of sensing electrodes 115A, 115B and 115C, 115D defining a detector volume therebetween. The sensing electrodes 115A-115D are connected to ammeters. Changes in current are measured as the analyte 125 passes through the detector volumes, allowing the length of the analyte 125 to be determined, as well as of distances between hybridized probes, analogously to the methods involving voltage changes, as described above.

Determining the Length of Biopolymers and Probe Separation Using Multiple Sensing Electrode Pairs

In an embodiment, a method for detecting the relative position of probes hybridized to a biopolymer and/or the length of the biopolymer does not rely on the absolute time between detection events of a given electrical signal to determine a distance associated with the biopolymer (e.g., a distance corresponding to a length between probes or the length of the biopolymer itself). Instead, multiple signals are obtained (e.g., as functions of time) corresponding to a plurality of detector volumes at known locations along a fluidic channel through which the biopolymer passes, and the distances are determined by comparing the multiple signals. The positional resolution of the detector may be limited only by the physical limits of fabricating detectors that have electrodes with known positions along the fluidic channel.

When a target biomolecule, such as single-stranded DNA, is incubated with a sequence selective probe under appropriate conditions, the probe hybridizes or binds to the biomolecule at specific sites. Using the known sequence of the probes, which are complimentary to the portion of analyte to which they are bound, and the determination of the relative location of the hybridization sites allows for the construction of maps of the target biomolecule, and the sequencing the target molecule.

Nanopores may be used as detectors to determine the distance between hybridization sites as described below in and U.S. Patent Application Publication No. 2007/0190542 A1, which is incorporated herein by reference in its entirety. The construction of a fluidic channel device incorporating voltage detectors is described above. In both the nanopore and the fluidic channel (e.g., a nanochannel), the distance between hybridization sites on the target biopolymer may be inferred from the time between the detection of a first hybridization position and a subsequent hybridization position as the biopolymer moves through the nanopore or fluidic channel.

The accurate determination of distance between hybridization positions in the previously described methods relies on a constant velocity of the biopolymer through the nanopore or fluidic channel during the measurement. The technology disclosed herein allows the determination of biopolymer length and distances between hybridization positions, independently of the velocity of the biopolymer.

Referring to FIG. 7 a, in some embodiments, a device for determining the length of an analyte 125 by detecting an electrical signal may include a fluidic channel 105 defined in a substrate. The substrate may be a suitable rigid material, such as silicon, silicon dioxide, fused silica, or gallium arsenide. The fluidic channel 105, e.g., a nanochannel or a microchannel, may be transversed by a plurality of sensing electrodes 115A-115F disposed along a length of the fluidic channel. The relative positions of the sensing electrodes 115A-115F are known. In the illustrated example, the fluidic channel 105 is transversed by six sensing electrodes 115A-115F. Detector volumes 700, e.g., 700A-700E, are disposed along the fluidic channel, and may be associated with one or more sensing electrodes 115A-115F; in the illustrated example, detector volumes 700A-E are defined between the sensing electrodes 115A-115F. A sensing element 701 may include two sensing electrodes associated with a given detector volume, e.g., 115A, 115B and detector volume 700A. Electrical signals corresponding to the detector volumes 700 are detected by the plurality of sensing electrodes 115A-115F. An analyte 125, e.g., a biopolymer is disposed in the fluidic channel. As the biopolymer enters each detector volume along the length of the fluidic channel, a change in the electrical signal is recorded. The length of the biopolymer may be determined from the number of detector volumes that show coincident signals.

The sensing electrodes 115A-115F may be configured for connection to a measurement tool 120 for capturing the electrical signals corresponding to the detector volumes. The measurement tool 120 may be, for example, a voltmeter, an ammeter, or a field-effect transistor. The captured electrical signals indicate the length of the analyte 125. Electrical signals captured by the measurement tool may be recorded as a function of time by a data collection device 702, e.g., a Stanford Research Instruments SIM970 voltmeter. An analyzing module 705, such as a computer, e.g., an apparatus including a memory that stores code defining a set of instructions and a processor that executes the instructions thereby to determine the length of the analyte from two or more detected electrical signals, may be in electrical communication with the data collection device, programmed to determine which detector volumes record a change in electrical signal at the same time. In some embodiments, an electronic circuit may be configured to output a signal only when two electrical signals corresponding to two detector volumes change at the same time.

As discussed with reference to FIG. 1, a pair of electromotive electrodes (110, 100′) may be disposed at a first and a second end of a fluidic channel. The pair of electromotive electrodes may include macroscopic electrodes arranged to generate a constant, changing, or oscillating electrophoretic force in the fluidic channel for translocation of the analyte disposed within the fluidic channel.

One or more of the sensing electrodes 115A-115F may be formed from a conductive material, such as platinum, gold, chrome, titanium, silver chloride, silver and/or graphene. As described above with reference to FIGS. 1-6, the sensing electrodes 115A-115F may have various configurations. For example, a sensing element corresponding to a given detector volume may include two sensing electrodes 115A, 115B disposed on opposing sides of the fluidic channel 105 (FIGS. 1 and 2). In another embodiment, the sensing element may include two sensing electrodes 115A, 115B disposed on the same side of the fluidic channel 105 (FIG. 4). In yet another embodiment, the sensing element may include a first sensing electrode 115A transversing the fluidic channel 105 and a second sensing electrode 115B transversing the fluidic channel 105 (FIG. 3).

The fluidic channel may have a width that is not smaller than approximately the same size as the analyte, and may be sufficiently large such that large molecules bound to the analyte may pass through the fluidic channel. For example the width of the fluidic channel may be selected from a range of 1 nm to 5 μm, preferably 1 nm to 1 μm, more preferably 10 nm to 100 nm. The fluidic channel may be sufficiently deep to allow large molecules bound to the analyte to pass through and yet shallow enough to be approximately the same size as the analyte. The fluidic channel depth may be, e.g., selected from a range of 1 nm to 5 μm, preferably, preferably 1 nm to 1 μm, more preferably 10 nm to 100 nm. The length of the fluidic channel may be selected such that the entire analyte is contained in the fluidic channel.

The size of the channel containing multiple sensing volumes may be chosen with regard to the persistence length of the analyte. For example, a randomly coiled polymer (e.g., DNA) may be elongated when introduced into a confined space, such that when the confinement space becomes smaller the extent of elongation becomes greater. In some embodiments, it may be preferable to elongate the analyte to measure length or distance between probes. Depending on the cross-sectional size and the persistence length it may be useful to have the geometric mean of the width and depth of the channel be between 5% and 500% of the persistence length of the analyte. For example, for double-stranded DNA, under conditions where the persistence length is 50 nm, it may be preferable to have, e.g., a fluidic channel with a width and depth between 2.5 nm and 250 nm. In other embodiments, for more rigid polymers such as RecA coated DNA, under conditions where the persistence length is 950 nm, it may be preferable to have, e.g., a fluidic channel with a width and depth between 45 nm to 4.75 μm.

While not wishing to be bound by theory, an analyte may be pulled into the fluidic channel (e.g., by electromotive force), and then extended (e.g., linearized) as the electromotive force pulls the analyte into the fluidic channel and may be counterbalanced by the mass of the analyte outside of the fluidic channel is being unwound. In some embodiments, the structure of the fluidic channel may facilitate entry of the analyte into the channel, e.g., the fluidic channel may comprise a series of posts (e.g., U.S. Pat. No. 7,217,562, which is incorporated by reference in its entirety) and/or a funnel shape.

In an embodiment, a fluidic channel detector may be preferably arranged such that the entire analyte enters the fluidic channel before it enters the first detector volume. This configuration provides the advantage of reducing the effect of the analyte on the conductance of the fluidic channel. For instance, if one is beginning to measure the change in potential of a detector volume while the conductance of the whole fluidic channel is changing due to more analyte entering the fluidic channel, the analysis becomes more complicated. Similarly, preferably, the analyte is contained completely in the fluidic channel when it exits the last detector volume. Thus, the length of the fluidic channel preferably has a minimum length that is approximately three times the length of the analyte (assuming that the detector volume is only as long as the analyte, which is a minimal requirement but not optimal). The length of a 1 kb piece of DNA is about 330 nm, so a length of the fluidic channel is preferably at least 1 μm in length. The longest piece of DNA suitable for analysis with the described methods may be 10 megabases (Mb), which corresponds to a preferred fluidic channel of at least 10 mm. More preferably, the length of a fluidic channel is ten times the length of the analyte, and thus a more preferred upper limit for a fluidic channel length is 100 mm (10 cm). Thus, the fluidic channel length is preferably selected from a range from 1 μm to 10 cm. Longer and shorter fluidic channel lengths are also possible.

A plurality of fluidic channels may be defined in the substrate, allowing parallel or sequential analysis of various analytes. A voltage amplifier may be incorporated into the device, configured to amplify two or more electrical signals.

Referring to FIG. 7 b, the electrical signal increases linearly as the analyte 125 enters the detector volume 700A. When the detector volume is completely filled, the signal stays essentially constant. The electrical signal may have fluctuations from noise and from the fact that a long analyte 125 such as DNA typically has small bends in it. When a section having a bend enters the detector volume, the electrical signal may increase slightly, and then may decrease slightly when the bend exits the detector volume. Calibration of the electrical signals, therefore, may be needed to determine length. If one measures just the distance between the ends of the analyte 125, one may not determine the actual length of the analyte 125 because of small kinks and bends. By calibrating the electrical signal by measuring the electrical signal generated by an analyte 125 of a known length, one may determine the actual length between ends.

The length of an analyte may be determined in two ways. In a first method, the length is determined by the distance between two sensing electrodes that detect the analyte at the same time. Sensing electrodes that are displaced with respect to one another by a distance that is less than the length of the analyte detect the analyte at the same time. Sensing electrodes that are positioned farther apart than the length of the analyte do not detect the analyte at the same time. Any of the sensing electrode configurations described herein may be used in this implementation. Sensing electrodes displaced along a length of a fluidic channel, as well as sensing electrodes disposed opposite each other in a fluidic channel may both be used.

In an embodiment, sensing electrodes 115A, 115B, 115C, 115D, 115E, 115F (not shown) may be respectively disposed sequentially along a length of a fluidic channel 105, defining three detector volumes 700A, 700C, 700E between sensing electrodes 115A and 115B, 115C and 115D, and 115E and 115F. If the analyte 125 fills is detected in two detector volumes (e.g., between sensing electrodes 115A and 115B and sensing electrodes 115C and 115D) at the same time, the analyte 125 is as long or longer than the distance between the two outer sensing electrodes 115A, 115D disposed at the outer bounds of the two detector volumes. If the analyte 125 is not detected at the same time in a third detector volume 700E, bound by sensing electrodes 115E, 115F, separated by a greater distance from sensing electrodes 115A, 115B than sensing electrodes 115C, 115D, then the analyte 125 is shorter than the distance between sensing electrodes 115B and 115E. Thus, the length of the analyte 125 is longer than the distance between sensing electrodes 115A and 115B and shorter than the distance between sensing electrodes 115B and 115E. The difference between the distance 115A-115D and 115B-115E may define the resolution of the system, i.e., how accurately one can determine the length of the analyte 125.

Referring again to FIG. 7 a, the resolution may be improved by also utilizing the time domain. That is, if the analyte 125 is detected at the same time in detector volumes 700B, 700C between sensing electrodes 115B-115C and sensing electrodes 115C-115D, but the analyte 125 is not detected in a detector volume 700D between sensing electrodes 115D-115E, one may measure the time between the loss of signal in detector volume 700B and the acquisition of signal in detector volume 700D. If the speed of the analyte 125 in the fluidic channel 105 is determined, then the measured time may be used to calculate the distance the analyte 125 moved (velocity×time=distance) before it reached detector volume 700D between sensing electrodes 115D, 115E and thus how much shorter the analyte 125 is than the distance between sensing electrodes 115B and 115D.

The second method is preferably used with detector volumes defined by displaced electrodes, as it relies on using the distance between electrodes rather than a distance between detector volumes. A detector volume defined by sensing electrodes separated by a distance that is less than the length of the analyte is completely filled by the analyte as the analyte moves through the fluidic channel, and provide an electrical signal that is the maximum for that detector volume/analyte combination. Thus, if two sensing electrodes 115A, 115B are separated by a known distance and the analyte completely fills the detector volume therebetween, then the analyte is at least as long as the separation between the two sensing electrodes. A detector volume bound by sensing electrodes 115C, 115D that are separated by a distance greater than the length of the analyte provides an electrical signal that is less than maximal. The ratio of the electrical signal (e.g., observed voltage change) to the maximum expected signal (e.g., maximum expected voltage change) of a detector volume is equal to the ratio of the analyte length to the distance between sensing electrodes bounding the detector volume.

1=d×(ΔV _(obs) /ΔV _(max))

Where the 1 is the analyte length, d is the distance between the sensing electrode, ΔV_(max) is the maximum expected voltage change of a detector volume, and ΔV_(obs) is the observed voltage change. This method can be used to determine the length of the biopolymer when the biopolymer is less than the distance between sensing electrodes. It may also be used to increase the resolution of the determination of the biopolymer length when the biopolymer does not fit perfectly between sensing electrode.

Calibration to determine the maximal electrical signal may be performed with a similar analyte that is longer than the distance between sensing electrodes being calibrated. A calibration factor may be calculated from the maximal signal obtained in other similar detector volumes that may have different separations between sensing electrodes, or it may be calculated theoretically from the known behavior of the analyte in the test set up and the known distance between the sensing electrodes. For example, if the electrical signal used is a voltage signal reflecting a potential drop across displaced sensing electrodes, the percent change in voltage signal in any detector volume for a maximal signal for a species of analyte is a constant. That is, if the voltage signal between sensing electrodes 115A, 115B changes by a maximum of 10% when a piece of double-stranded DNA that is much longer than the distance between sensing electrodes 115A, 115B goes through the detector volume, then the maximum change in potential in any other detector volume in the same fluidic channel, in which the channel width and depth remain essentially constant, may also be 10%.

In some embodiments, the sensing electrodes have thicknesses ranging from 5 nm to 100 nm at the point where the sensing electrodes intersect the fluidic channels. The sensing electrodes may be wider and/or thicker in regions distal to the fluidic channels and approaching contact pads disposed at the perimeter of the device. As an example, a width of 20 nm may be assumed for each of the sensing electrodes. The sensing electrodes may be disposed along the fluidic channel at regular intervals with respect to one another, by, e.g., 10 nm or 30 nm.

In particular, in an embodiment, a device utilizes multiple pairs of nanoscale sensing electrodes for electrical sensing of biomolecules and other nanoscale analytes in fluidic channels. The analytes may include a biopolymer such as deoxyribonucleic acids, ribonucleic acids, and polypeptides. The biopolymer may be a single-stranded molecule. Portions of the analyte sequence may be at least partially hybridized with a probe, and the detected electrical signals may indicate the presence of a probe bound to the analyte. The sensing electrodes disposed along the fluidic channel may be used to determine the length of the analyte. In use, the analyte may be disposed in the fluidic channel. A potential may be applied along the fluidic channel to generate an electrophoretic force therein, such that the analyte is translocated from a one end of the fluidic channel to another end of the fluidic channel. As the analyte moves down the fluidic channel, it enters the detector volume disposed along the fluidic channel. The analyte may occupy more than one detector volume at the same time. Each occupied detector volume emits an electrical signal that indicates the presence of the analyte in the detector volume. The electrical signal may be a voltage signal, a current signal, or another electrical signal. As the analyte moves through the fluidic channel, the sensing electrodes may detect two or more electrical signals corresponding to two or more detector volumes.

The electrical signal(s) may be recorded, e.g., by a measurement tool in electrical communication with the sensing electrodes. The length of the analyte may be determined by analyzing the detected electrical signals. For instance, one may identify at least two detector volumes in which the analyte is sensed at a given time, and then determine a distance between sensing electrodes corresponding to the at least two detector volumes. Thus, the furthest spaced detector volumes that have coincident signals may provide a lower bound for the length of the analyte in the fluidic channel while the closest spaced detector volumes that simultaneously lack a signal may provide an upper bound for the length of the analyte in the fluidic channel. The distance between the outer ends of the furthest spaced detector volumes that have coincident signals provides an upper bound on the length of the analyte in the fluidic channel. The distance between the inner ends of the furthest spaced detector volumes that have a coincident signal provides a lower bound on the length of the analyte in the fluidic channel. The error in the determination of the length of the analyte depends on the accuracy to which the distances between sensing electrodes is known.

A correction factor may be applied to the measured length to determine the actual length of the analyte. For instance, it is known that a polymer that is confined to nanoscale channels stretches to a varying percentage of its full length depending on the size of the fluidic channel and the persistence length of the polymer under the conditions of the experiment (Tegenfeldt, J. O. et al. Proc. Nat. Acad. Sci. 2004, 101, 10979) which is incorporated herein by reference in its entirety. Analytes that are longer than their persistence length (i.e., longer than the maximum length of the uninterrupted polymer chain persisting in a particular direction) tend to form random coils or balled-up structures in solution. When a biopolymer is confined to a fluidic channel, it may be forced to extend and become more linear. However, depending on the size of the fluidic channel and the persistence length of the biopolymer, it may not be straightened out to its full length. Thus, any measurement of the length of the biopolymer in a fluidic channel is preferably corrected either by using a calibration standard in the same detector volume or by applying a correction factor based on the behavior of the biopolymer in the fluidic channel.

In another embodiment, multiple sensing electrodes may be used to determine the distance between labeled positions on a biopolymer by observing electrical signals indicating labels that are coincident in two or more sensing elements. The biopolymer may be DNA or RNA. The RNA or DNA may be single-stranded or double-stranded. The label may be a protein that is bound to the biopolymer or it may be an oligonucleotide that is hybridized to the biopolymer. The label may also be any molecule that binds to the biopolymer. The binding may be sequence dependent, or it may be conformation dependent. FIG. 7 c illustrates the slope of the electrical signal that reflects first the entry of the biopolymer into a detector volume 700A, and then the entry of a hybridized probe 710 into the same detector volume 700A.

The distance between sensing electrodes allows one to determine the distance between the labeled positions. The determination of the distance between labeled positions relies on knowledge of the distance between the sensing electrodes and does not require that the biopolymer move with constant velocity through the device. The spacing between sensing electrodes may be chosen to vary in a fashion that enables the determination of a variety of distances. The distances between hybridization events may be used to map, haplotype, or sequence biopolymers such as DNA or RNA.

The determination of distances between hybridization events is illustrated in FIGS. 8 a-i, where a fluidic channel 105 is transversed by 6 sensing electrodes 115A-115F, (e.g., a distance-based detector). FIGS. 8 a-8 i illustrate the progression of a hybridized biomolecule through a fluidic channel, and the resulting electrical signals. A biopolymer with hybridized probes 710 is disposed in the fluidic channel. In FIG. 8 d, two sensing elements (each including two sensing electrodes 115 and the detector volume 700 therebetween) indicate the presence of hybridized probes 710 at the same time. A distance between probes 710 may then be determined by using a response of electrical signals corresponding to two or more detector volumes. Correlation of signals from different detector volumes gives the distance between probes. A spacing between sensing electrodes 115A-115F may be used to determine a maximum distance or a minimum distance between probes 710. In the illustrated embodiments, the distance between these probes 710 is less than the distance between sensing electrodes 115A, 115C and greater than the distance between sensing electrodes 115B, 115C.

The calculation of distances between probes 710 may be used to determine the sequence of a biopolymer as follows. An analyte 125 may be prepared by hybridizing a first plurality of probes 710 with the biopolymer such that the first plurality of probes 710 attaches to portions of the biomolecule to produce a partially hybridized biomolecule. A plurality of probes 710 may be made up of multiple copies of the same probe 710 with the same sequence selectivity or may be made up of two or more probes 710 with different sequence selectivity. A partially hybridized biomolecule is created when the entire length of a sequence selective probe 710 binds to a portion of the length of the target biomolecule. The probes 710, i.e., hybridizing oligonucleotides, may each have the same composition, or may have different compositions. Further details regarding hybridization are given below.

The analyte 125 may be disposed in a fluidic channel 105, e.g., a nanochannel or a microchannel. A potential may be applied along the fluidic channel 105 to generate an electrophoretic force therein such that the analyte 125 is translocated from a first end of the fluidic channel 105 to a second end of the fluidic channel. Two or more electrical signals may be detected as the analyte 125 moves through the fluidic channel. The detected electrical signals, corresponding to two or more detector volumes of the fluidic channel, may be detected by using a plurality of sensing electrodes 115A-F disposed along the length of the fluidic channel. The detected electrical signals may indicate the locations of the hybridized probes 710 along the biopolymer. The electrical signals may be analyzed to determine in which detector volumes probes 710 bound to the biomolecule are located. This analysis may be done either visually or with the assistance of a computer program that executes the analysis described herein. At least a portion of the sequence of the biopolymer may be determined by using a distance between the sensing electrodes 115A-115F corresponding to the detector volumes in which the probes 710 are located and the known recognition site sequence of the probe 710. At each point where the probe has bound, the sequence may be known because it is complementary to the bound probe. The combination of the known recognition site sequence and its distance from the end of the biopolymer allow, for the determination of the biopolymer sequence. A computer algorithm may be used to process the two or more electrical signals to help determine the sequence of the biopolymer.

Additional details regarding the sequencing of a biopolymer based on positional information obtained by use of nanopores are provided below in the section Sequencing by Using Probes (fluidic channels are elongated nanopores) as well as in U.S. Patent Publication No. 2007/0190542 A1, which is incorporated herein by reference in its entirety. Further details regarding reconstructing positional data are disclosed in U.S. Publication No. 2009/0099786 which is incorporated herein by reference in its entirety; the publication discusses positional data within the context of sequencing of primarily double-stranded DNA, but the principles are applicable to the sequencing of single-stranded DNA.

As shown in FIG. 7 c, the electrical signal may initially change when the biopolymer moves through a detector volume 700A associated with two sensing electrodes 115A, 115B and further change when a portion of the biopolymer including a hybridized probe 710 moves through the detector volume 700A.

In some embodiments, a second plurality of probes may be hybridized with the biopolymer, and the detecting, analyzing, and determining steps may be repeated with the second plurality of probes. This subsequent hybridization may occur with the same biopolymer sample that was exposed to the first plurality of probes, or it may occur using a naïve, previously unhybridized sample.

The electrical signals may be used to detect and record complexed and uncomplexed regions of the biopolymer to create a first probe map of the first plurality of probes and a second probe map of the second plurality of probes, the first probe map and the second probe map each respectively including information about the relative positions of the hybridized first and second plurality of probes. Each probe map may include a series of numbers that indicate the distances between probes. The numbers may indicate distance in terms of base pairs or distance in terms of nanometers. A candidate sequence for at least a portion of the biopolymer may be determined by ordering at least two probe sequences using positional information and/or a combination of overlapping probe binding sequences and positional information.

The biopolymer may include a double-stranded biopolymer target molecule.

The first and second probe maps may include information about an error of the positional information for each probe. For example, each indicated distance may have an associated standard deviation, e.g., 100 nm±10 nm. Further, a candidate sequence may be determined by ordering at least two probe sequences using at least one of (i) positional information and parameters relating to the error in positional information or (ii) a combination of overlapping sequences of the probe molecules and positional information and error in positional information.

The analyte may be prepared by contacting the biopolymer, i.e., the target molecule, with a first probe having a first probe specificity for recognition sites of the target molecule to form a first plurality of local ternary complexes, the first probe having a first known recognition site sequence. The electrical signal may be used to determine positional information of the first plurality of local ternary complexes.

The biopolymer may also be contacted with a second probe, either subsequently or in parallel to the first probe, having a second probe specificity for recognition sites of the target molecule to form a second plurality of local ternary complexes. The second probe may have a second known recognition site sequence. Positional information of at least the first and second plurality of local ternary complexes may be aligned to determine a sequence of the biopolymer.

Referring to FIGS. 9 a and 9 b, a fluidic channel 105 is transversed by numerous perpendicular sensing electrodes 115. The sensing electrodes 115 may be grouped into clusters of finely spaced electrodes, with greater separation between clusters. Finely spaced electrodes give distance information over short distances. Electrodes spaced further apart provide correlation signals over longer distances.

The sensing electrode 115 arrangement of FIG. 9 b allows the measurement of biopolymer lengths with an accuracy of about 100 bp. Larger spacing of the sensing electrodes 115 allows longer lengths to be measured. By combining regions of short spacing with regions of long spacing, measurement of long lengths with high resolution may be obtained. As illustrated, in an embodiment, sensing electrodes 115 are arranged to allow for accurate measurement of longer lengths in small length increments. That is, the sensing electrode 115 spacing enables high resolution even for longer lengths. For instance, if the analyte is DNA, the small increments may have sensing electrode 115 separations of about 100 bases, and the groups of tightly spaced sensing electrodes 115 may be separated by 1000 bases to 10,000 bases or more.

Sensing electrode spacing may be optimized with respect to the number of different length measurements that may be made with a defined number of detector volumes or a defined number of sensing electrodes. The spacing may be based on a Golomb ruler. Referring to FIG. 9 c, a Golomb ruler (e.g., of length 6 as shown in FIG. 9 c) in this context would define a set of integer distances between sensing electrodes such that no two distances are the same. For instance, sensing electrodes at positions 0, X, 4X, 6X would give lengths of X, 2X, 3X, 4X, 5X, and 6X.

A Golomb ruler may also be used to set the spacing between sensing electrodes for each detector volume. Thus, the spacing between the sensing electrodes defining the first detector volume may have a distance X. The next sensing electrode may be placed at a distance of 3X from the second sensing electrode. The next sensing electrode may be placed at a distance 2X from the third electrode. Detector volumes having lengths X, 2X, 3X, 4X, 5X, and 6X may then be obtained from 4 electrodes.

Golomb rulers may serve as a starting point from which other electrodes may be inserted in order to sample all lengths or have all detector volumes represented. As a result, one or more of the lengths or spacings may be repeated. For instance, the Golomb ruler: 0, 3X, 4X, 9X, 11X can measure lengths X, 2X, 3X, 4X, 5X, 6X, 7X, 8X, 9X, 11X. The distance 10X is missing. By modifying the ruler to be 0, 3X, 4X, 9X, 10X, 11X the possible measured lengths are X, X, X, 2X, 3X, 4X, 5X, 6X, 6X, 7X, 7X, 8X, 9X, 10X, 11X. Thus, the X, 6X, and 7X spacings have more than a single occurrence. However, six pairs of sensing electrodes are capable of measuring 11 integer increments of distance.

Other spacings, integral or non-integral, may be envisioned by one of skill in the art.

Referring to FIG. 10, in an alternative arrangement, sensing electrodes 115A-115F transverse a fluidic channel 105 with varying distances between sensing electrodes 115A-115F. The distances between sensing electrodes 115A-115F may be selected in accordance with the principles discussed with reference to FIGS. 9 a-9 c. As illustrated, the device may include at least three pairs of sensing electrodes 115A, 115B; 115C, 115D; 115E, 115F. A distance between detector volumes 700A, 700C defined by a first and second pair of sensing electrodes 115A, 115B; 115C, 115D may be equal to a distance between sensing electrodes 115B, 115C, and a distance between detector volumes 700C, 700E defined by the second and third pair of sensing electrodes 115C, 115D; 115E, 115F may be equal to a distance between sensing electrodes 115D, 115E. Thus, the distance between detector volumes 700A, 700C defined by the first and second pairs of sensing electrodes 115A, 115B; 115C, 115D may be unequal to a distance between detector volumes 700C, 700E defined by the second and third pairs of sensing electrodes 115C, 115D; 115E, 115F. In some embodiments, the detector volume 700A defined by the first pair of sensing electrodes 115A, 115B is unequal to a detector volume 700C defined by a second pair of sensing electrodes 115C, 115D, and a detector volume 700C defined by the second pair of sensing electrodes 115C, 115D is unequal to a detector volume 700E defined by a third pair of sensing electrodes 115E, 115F.

The arrangement of FIG. 10 is shown in use in FIG. 11 a-g, where a biopolymer with hybridized probes 710 is disposed in a fluidic channel. The detector volume 700E between sensing electrodes 115E-115F contains two probes 710″, 710′″. The distance between the two probes 710″, 710′″, therefore, is less than the distance between sensing electrode 115E and sensing electrode 115F. Two peaks for a signal from the same detector volume, e.g., 700E, indicate that the distance between two sensing electrodes 115E, 115F associated with the detector volume 700E is an upper limit of the distance separating two detected probes. FIGS. 11 b-11 g illustrate the translocation of the biopolymer with hybridized probes 710 being translocated through the fluidic channel; the electrical signals reflect the positions of the biopolymer and probes 710 in the detector volumes 700 between sensing electrodes 115A-115F having variable distances. Referring to FIG. 11 e, coincident signals from two different detector volumes 700A, 700C are indicative of a distance between the probes.

Sequencing by Using Probes

Sequencing a biomolecule such as DNA using one or more probes may be performed as follows, in combination with the length/distance measurement techniques disclosed herein.

Referring to FIG. 12, a DNA molecule 1200, i.e., a type of analyte 125, is schematically depicted and is structured in two strands 1205, 1210 positioned in anti-parallel relation to one another. Each of the two opposing strands 1205, 1210 may be sequentially formed from repeating groups of nucleotides 1215 where each nucleotide 1215 consists of a phosphate group, 2-deoxyribose sugar and one of four nitrogen-containing bases. The nitrogen-containing bases include cytosine (C), adenine (A), guanine (G) and thymine (T). DNA strands 1205 are read in a particular direction, from the so-called the 5′ or “five prime” end to the so-called the 3′ or “three prime” end. Similarly, RNA molecules 1300, as schematically depicted in FIG. 13, are polynucleotide chains, which differ from those of DNA 1200 by having ribose sugar instead of deoxyribose and uracil bases (U) instead of thymine bases (T).

Traditionally, in determining the particular arrangement of the bases 1215 and thereby the sequences of the molecules, a process called hybridization may be utilized. The hybridization process is the coming together, or binding, of two genetic sequences with one another. This process is predictable because the bases 1215 in the molecules do not share an equal affinity for one another. T (or U) bases favor binding with A bases while C bases favor binding with G bases. Sequence selective binding is mediated via hydrogen bonds that exist between the opposing base pairs. For example, A and T (or U) for two hydrogen bonds using two hydrogen bond acceptor and donors that are lined up with respect to each other in the duplex. Similarly the nucleosides C and G bind to one another via three hydrogen bonds formed by hydrogen bond acceptor and donors on the bases.

A hybridizing oligonucleotide, i.e., a probe 710 may be used to determine and identify the sequence of bases 1215 in the molecule of interest. FIG. 14 illustrates a probe 710 that is a short DNA sequence having a known composition. Probes 710 may be of any length depending on the number of bases 1215 that they include. For example, a probe 710 that includes six bases 1215 is referred to as a six-mer wherein each of the six bases 1215 in the probe 710 may be any one of the known four natural base types A, T(U), C or G and alternately may include non-natural bases.

In this regard, the total number of unique probes 710 in a library depends upon the number of bases 1215 contained within each probe 710 and the number of different types of bases in the probes. If only the four natural bases are used in probe 710, the total number of probes in the library is determined by the formula 4^(n) (four raised to the n power) where n is equal to the total number of bases 1215 in each probe 710. Formulas for other arrangements or types of bases are well known in the art. Accordingly, the size of the probe library can be expressed as 4^(n) n-mer probes 710. For the purpose of illustration, in the context of a six-mer probe, the total number of possible unique, identifiable probe combinations includes 4⁶ (four raised to the sixth power) or 4096 unique six-mer probes 710. The inclusion of non-natural bases allows for the creation of probes that have spaces or wildcards therein in a manner that expands the versatility of the library's range of probe recognition. Probes that include universal bases organized into patterns with natural bases may also be used, for example those described in U.S. Pat. Nos. 7,071,324, 7,034,143, and 6,689,563, which are incorporated herein by reference in their entireties.

The process of hybridization using probes 710, as depicted in FIG. 15, may begin by denaturing the biomolecule strand 1500, i.e., a basis for an analyte 125. Less than a milligram of DNA may be used. Preferably less than 100 micrograms of DNA are be used. More preferably, less than 20 micrograms are be used. Most preferably 1-10 micrograms of DNA are be used. For re-sequencing applications or for applications that do not sequence an entire human genome, the amount of DNA used may be less than a nanogram. Denaturing is accomplished usually through the application of heat or chemicals, such that the hydrogen bonds between the two strands of the original double-stranded DNA are broken leaving two single strands of DNA whose bases are now available for hydrogen bonding. After the biomolecule 1500 has been denatured, a single-stranded probe 710 may be introduced to the biomolecule 1500 to locate portions of the biomolecule 1500 that have a base sequence that correlates to the sequence that is found in the probe 710. In order to hybridize the biomolecule 1500 with the probe 710, the denatured biomolecule 1500 and a plurality of the probes 710 having a known sequence are both introduced into a solution. The solution may be an ionic solution, such as a salt-containing solution. The solution conditions, such as salt concentration, ionic concentration, temperature and pH are set to insure stringent and accurate binding of the probe 710 to the complimentary portion of the biomolecule 1500. The mixture may be agitated to facilitate binding of the probes 710 to the biomolecule 1500 strand along portions thereof that have a matched complementary sequence. Hybridization of the biomolecule 1500 with the probe 710 may be accomplished before the biomolecule 1500 is introduced into a sequencing apparatus or after the denatured biomolecule 1500 has been placed into a cis chamber 1615 of the apparatus described below with reference to FIG. 16. In this case, after the denatured biomolecule has been added to the cis chamber 1615, buffer solution containing probes 710 with a known sequence is also added to the cis chamber 1615 and allowed to hybridize with the biomolecule 1500 before the hybridized biomolecule is translocated.

An embodiment of a fluidic channel sequencing arrangement is graphically depicted in FIG. 16. One of skill in the art will readily recognize that a fluidic channel 105 is an elongated nanopore.

For the purpose of illustration, relatively short biomolecule strands 1500 with only two probes 710 are depicted. Long-stranded biomolecules 1500 may be translocated through the fluidic channel 105 to determine the location of the probes 710 attached thereto. The sequencing arrangement 1600 includes a fluidic channel 105 formed in an insulating material 1610. For example, the fluidic channel 105 may be formed in a solid-state material. Further, the fluidic channel 105 may have a diameter that allows the passage of double-stranded DNA and have dimensions of width and depth that are between approximately 1 nm and 5 μm, preferably between 2.3 nm and 100 nm, more preferably between 2.3 nm and 50 nm, e.g., 30 nm. The fluidic channel 105 is positioned between two fluid chambers, a cis chamber 1615, and a trans chamber 1615′, each of which is filled with a fluid. The cis chamber 1615 and the trans chamber 1615′ are in fluidic communication with one another via the channel 105 located in the insulating material 1610. A voltage is applied along the fluidic channel 105. This potential difference between the chambers 1615 on opposing sides of the fluidic channel 105 results in a measurable ionic current flow through the fluidic channel 105. In one embodiment, an electrode 1620 may be installed into each of the cis 1615 and trans 1615′ chambers to apply an electrical potential down the length of the fluidic channel 105. In an embodiment, the electrode in the cis chamber 1615 is a cathode, and the electrode in the trans chamber 1615′ is an anode.

The hybridized biomolecule strand 1500 with the probes 710 attached thereto is then introduced into the cis chamber 1615 in which the cathode is located. In an embodiment, the biomolecule 1500 is then driven or translocated through the fluidic channel 105 as a result of the applied voltage. As the biomolecule 1500 passes through the channel 105, electrodes 115 are monitored with respect to the electrical potential between pairs of electrodes or changes in a current applied between electrodes across the channel.

An analogous situation in which current is measured during translocation of a DNA strand through a nanopore is shown in FIG. 17. In the case of a nanopore 1750, the measured current runs parallel to the movement of the biomolecule 1500. For a fluidic channel system, a current-based sensor measures the current between electrodes on opposing sides of the fluidic channel 105. The current measurement is thus essentially perpendicular to the direction of movement of the biomolecule 1500. In the case of the nanopore, variations in current are the result of the relative diameter of the biomolecule 1500 that is passing through the nanopore 1750 at any given time. For example, the portions 1700 of the biomolecule 1500 that have probes 710 bound thereto are twice the diameter of the portions 1705 of the biomolecule 1500 that have not been hybridized and therefore lack probes 710. This relative increase in volume of the biomolecule 1500 passing through the nanopore 1750 causes a temporary interruption or decrease in the current flow there through, resulting in a measurable current variation as is depicted in the waveform 1710 at the bottom of the figure. As the portions 1700 of the biomolecule 1500 that include probes 710 pass through the nanopore 1750, the current is partially interrupted forming a relative trough 1715 in the recorded current during passage of the bound portion 1700. Similarly, as the unhybridized portions 1705 of the biomolecule 1500 pass, the current remains relatively high forming a peak 1720 in the measured current. The electrodes 1620 installed in the cis 1615 and trans 1615′ chambers detect and reflect these variations in the monitored current. Further, the measurements of the current variations are measured and recorded as a function of time. As a result, the periodic interruptions or variations in current indicate where, as a function of relative or absolute position, the known probe 710 sequence has attached to the biomolecule 1500.

In an analogous fashion, the potential measured between electrodes 115 in FIG. 16 will change as portions 1700 of the biomolecule 1500 that include probes 710 pass through the detector volumes delineated by electrodes 115. The periodic changes in potential or current indicate where, as a function of relative or absolute position, the known probe 710 sequence has attached to the biomolecule 1500.

The measurements obtained and recorded, as well as the time scale, may be input into a computer algorithm that maps the binding locations of the known probe 710 sequences along the length of the biomolecule 1500. Once the probe 710 locations are known, since the probe 710 length and composition is known, the sequence of the biomolecule 1500 along the portions 1700 to which the probes 710 were attached can be determined. This process can then be repeated using a different known probe 710. Further, the process can be repeated until every probe 710 within the library of n-mer probes has been hybridized with the biomolecule 1500 strand of interest. It can be seen in FIG. 18 that by repeating the process with different known probes 710′, 710″ and 710′″, the gaps in the portions of the biomolecule 1500 are gradually filled in with each subsequent hybridization and sequencing step until eventually the entire sequence of the biomolecule 1500 of interest is known.

Each subsequent hybridization and sequencing of the biomolecule 1500 may be accomplished in a variety of ways. For example, a plurality of nanopore assemblies, each sequencing copies of the same biomolecule of interest using different known probes, may be utilized simultaneously in a parallel fashion. Similarly, the same biomolecule may be repetitively hybridized and sequenced by passing it through a series of interconnected chambers. Finally, any combination of the above two processes may also be employed.

Detection of variations in electrical potential between the cis 1615 and trans chambers 1615′ as the hybridized biomolecule 1500 of interest passes through the nanopore 1750 may be accomplished in many different ways. For example, the variation in current flow as described above may be measured and recorded. Optionally, the change in capacitance as measured on the nanopore membrane itself may be detected and recorded as the biomolecule 1500 passes through the nanopore. Finally, the quantum phenomenon known as electron tunneling may be measured, whereby electrons travel in a perpendicular fashion relative to the path of travel taken by the biomolecule. In essence, as the biomolecule 1500 passes through the nanopore 1750, the probe 710 locations bridge the nanopore 1750, thereby allowing electrons to propagate across the nanopore in a measurable event. As the electrons propagate across the nanopore, the event is measured and recorded to determine the relative probe binding locations. The particular method by which the electrical variations are measured is not important, only that fluctuation in electrical properties is measured as they are impacted by the passing of the biomolecule through the nanopore.

The way in which the electrical potential varies, as a function of time, may depend on whether a single-stranded (un-hybridized) or double-stranded (hybridized) region of the biomolecule is passing through the nanopore 1750 and may be complicated. In the simplest scenario, the double-stranded region 1700 may suppress the current in comparison to the single-stranded region 1705, which may suppress the current in comparison to when no biomolecule 1500 is translocating. However, for small nanopore 1605 dimensions or low salt concentrations, the current may be augmented with the translocation of double-stranded portions 1700. In this case, the points of increased current may be used as an indicator of where the probes 710 are positioned along the biomolecule 1500.

The recorded changes in electrical potential across the nanopore 1750 as a factor of time may then processed using a computer and compiled using the sequences of the known probes 710 to reconstruct the entire sequence of the biomolecule 1500 strand of interest.

EXAMPLES

The following three constructive examples describe three possible scenarios in which the methods and devices described herein may be used to measure biopolymer lengths and to sequence biopolymers.

Example 1

A sensing device composed of two microfluidic chambers, one or more fluidic channels connecting the two microfluidic chambers, and two or more sensing electrodes disposed along the length of each fluidic channel, is filled with an ionic fluid. Typically, the fluid may be water that contains salt.

Multiple copies of a fragment of DNA of unknown length may be introduced into one of the microfluidic chambers that is connected to the fluidic channel that contains the multiple sensing electrodes. Macroscopic electrodes are used to electrophoresis the DNA strands from the microfluidic chamber into one or more fluidic channels. As the DNA enters the fluidic channel, it assumes a linear conformation. The degree to which it is linearized depends on a number of factors. Some of those factors are, e.g., the persistence length of the DNA strand, the temperature, the ionic conditions, and the width and depth of the fluidic channel.

The potential applied by the macroscopic electrodes causes the DNA strand to move down the length of the fluidic channel. As the fragment moves down the fluidic channel it passes through each of the sensing elements, i.e., detector volumes. When the leading edge of the DNA enters a detector volume, a change in some electrical characteristic such as cross channel current or potential between two sensing electrodes that define the detector volume may be recorded. As the trailing edge of the DNA strand exits the detector volume, the electrical response typically returns to the value which was observed before the DNA entered the volume. The magnitude of the electrical response depends on the experimental set-up; preferably, the electrical response is equal to at least 3 times the magnitude of the root mean square noise for the system.

When the DNA enters a detector volume, an electrical signal is recorded. The signal is composed of a time stamp and an indication of which detector had changes in potential or other electrical property. The value of the electrical property may also be recorded. The value may be subtracted from the background signal or may be an absolute value. A table may be generated by a computer that lists all detector volume responses and the time stamp for each response. A computer program may subsequently determine when there has been coincident detection by determining when two or more detector volumes detect the presence of the DNA with the same time stamp. Each time that two or more detector volumes detect DNA at the same time, the time and affected detector volumes are noted. The distance between the affected detector volumes may also be recorded. Following complete translocation of the DNA fragment through all of the detector volumes in the fluidic channel, a computer program may be used to determine which of the recorded coincident detection events involved the two detector volumes with the largest distance between them.

The lower limit of the length of the DNA may be determined by calculating the distance between the two maximally separated detector volumes that indicate the presence of the DNA fragment at the same time. The upper limit of the length of the DNA fragment may be determined by calculating the distance between the closest two detector volumes that do not sense the DNA fragment at the same time during the experiment. The difference between these two distances defines the error in the measurement. Multiple copies of the same fragment may be observed independently in the same or multiple fluidic channels during the experiment.

Depending on how the distance between detector volumes is determined, a correction factor may be applied to the measured length in order to calculate the true length of the DNA fragment. For example, the distance between detector volumes may be measured by optical or electron microscopy during or after fabrication of the device. In this case, the length of the DNA calculated by the separation of the detector volumes does not take into account the incomplete linearization of the DNA in the fluidic channel. The extent of linearization may be estimated from literature values for linearization determined by optical methods on DNA in fluidic channels. If an estimated extent of linearization is 75%, then the measured length is divided by 0.75 to give the actual length.

The extent of linearization may also be determined by passing a DNA fragment of known length down the fluidic channel under the same conditions of temperature, pH, and ionic strength as the fragment of unknown length. The measured length may be used to calculate a correction factor as follows:

the measured length/the known length=correction factor.

For instance, if a fragment of DNA whose known length is 143 nm is placed in the fluidic channel and the measured length of the DNA fragment is 100 nm, then the correction factor is 0.70. This indicates that under this set of conditions and with this fluidic channel device, the extent of linearization is 70%. If an unknown fragment is measured in the same device under the same conditions and the measured length is 400 nm, then the actual length is 400 nm/0.70=571 nm. It is contemplated that single-stranded DNA and double-stranded DNA may be calibrated separately.

Rather than determining the distance between detector volumes by microscopy, the device may also be calibrated with a series of DNA fragments of known length. The fragments preferably span enough different lengths to calibrate all detector volumes that may be used in the experiment to measure the length of the unknown fragment. If DNA fragments of a known length are used to calibrate the device, no further correction factors that take into account the extent of linearization may be needed. For instance, a DNA fragment of known length 150 nm may pass through the detector volumes in the fluidic channel; the two detector volumes that are maximally separated and detect the DNA fragment at the same time may be, e.g., detector volumes 700B and 700D. When an unknown fragment passes through the fluidic channel and is also detected by, e.g., detector volumes 700B and 700D at the same time but not by any other pairs of detector volumes at a larger distance, then the length of the unknown fragment is at least 150 nm.

A mixture containing several different length fragments of DNA may be introduced into the fluidic channel. Detection in the fluidic channel allows the determination of lengths for each of the fragments in the mixture.

Example 2

A target DNA strand of known or unknown sequence may be denatured. Denaturation of the duplex DNA is typically accomplished through the application of heat or chemicals, such that the hydrogen bonds between paired strands are broken. The denatured DNA sample may be incubated with a probe of known sequence and base length or divided for incubation with multiple probes, each with their own specific recognitions sequences on the target DNA. In order to hybridize the probe or probes to their recognition sequence or sequences, the conditions for the incubation are chosen such that the probe or probes bind to the known specific recognition site in preference to other sites or mismatch sites. The conditions are also chosen so that more of the probe binding sites on the denatured DNA strands are bound to a probe than unbound. The solution may be a buffered ionic solution. The solution may be agitated to facilitate binding of the probes. The temperature of the solution may be varied during the course of the incubation. For instance, the temperature of the incubation may be slowly cooled over the course of the hybridization.

Once the denatured target DNA has been hybridized with a probe or probes, the sample may be introduced into a microfluidic chamber at one end of the fluidic channel device. The fluidic channel device may be filled with an ionic solution, e.g., a salt solution. The solution may also be buffered. The excess probe or probes may be removed prior to the introduction of the sample into the microfluidic chamber. Gel filtration is one method for removing short probes from a longer strand of DNA. Alternatively, other commercially available purification methods are available. Once the target DNA strand with hybridized probes has been introduced into a microfluidic chamber, a potential is applied via macroscopic electrodes to drive the DNA from the microfluidic chamber into one or more fluidic channels.

The target DNA upon entering the fluidic channel typically assumes a linearized conformation. The narrower the fluidic channel, the more linearized the DNA is forced to become. The voltage applied to the macroscopic electromotive electrodes electrophoretically drives the DNA down the fluidic channel. As the DNA and hybridized probes move down the fluidic channel they enter each of the detector volumes disposed along the fluidic channel. In this example, each detector volume includes two sensing electrodes that determine the outer boundaries of the detector volume. Each pair of sensing electrodes may be connected to a device that measures the potential between the two sensing electrodes. The source of the potential difference between the sensing electrodes is the potential applied to the macroscopic electrodes that are disposed at the ends of the fluidic channel. The value of the potential difference typically depends on the device geometry with respect to the size of the fluidic channel, the potential applied to the macroscopic electrodes, the distance between the sensing electrodes in a pair, and the conductivity of the fluid-filled volume between the two sensing electrodes.

In the absence of DNA, the detector volume may contain only the ionic solution and have a baseline potential difference measured between the two sensing electrodes that define the detector volume. As DNA enters the detector volume, the potential measured between the two sensing electrodes changes because the DNA has a conductivity different from that of the ionic solution. When DNA enters the detector volume, the conductivity of the fluidic channel between the two sensing electrodes is typically reduced with respect to the conductivity when only ionic fluid is present between the sensing electrodes. As DNA enters a detector volume, the change in potential or some other electrical property is measured. When a portion of the DNA that also has a probe hybridized thereto enters the detector volume, the potential changes further.

The target DNA has two or more positions where a probe or probes has hybridized and is electrophoretically driven down the fluidic channel and through each of the detector volumes in the fluidic channel. As the DNA moves down the fluidic channel, the locations on the DNA to which the probes are hybridized also move through each of the detector volumes in turn. When a probe, on the target DNA enters a detector volume, an electrical signal is recorded. The electrical signal is composed of a time stamp and an indication of which detector had changes in potential or other electrical property. The value of the electrical property may also be recorded. The electrical property value may be subtracted from the background signal or may be an absolute value. A table may be generated by a computer that lists all detector volume responses and the time stamp for each response. A computer program may subsequently determine occurrences of coincident detection by determining when two or more detector volumes detect the presence of a probe on the target DNA with the same time stamp. Each time that two or more probes are disposed in detector volumes at the same time, the time and affected detector volumes are noted.

The distance between any two probes may be calculated from the distance between pairs of detector volumes that show coincident detection for that pair of probes. For instance, if two detector volumes separated by 100 nm show coincident signals for probes, then a distance of 100 nm between probes may be recorded. The greater number of different distances that exist between pairs of detector volumes in the fluidic channel, the more efficient the device may be in determining the distance between two probes.

In order to determine the distance between two detected probes, it is generally necessary to know the distance between each pair of detector volumes in which the probes are detected. This may be accomplished during fabrication of the device, at which time the position of the detector volumes may be noted. The distances may also be determined after fabrication by, e.g., electron microscopy. Finally, the distances between detector volumes may be determined by calibrating the device with known lengths of DNA or DNA having probes hybridized at known positions.

If the latter technique (calibration with a biopolymer) is used, no further correction of the measured distance between probes on the target DNA needs to be made. However, if the distance between detector volumes is determined during fabrication or by electron or optical microscopy, then a further correction to the determined distance between two probes is typically needed. This is a result of the fact that the target DNA and associated probes may not be perfectly linearized by the fluidic channel. For instance, the DNA may only be linearized by 70% in a 100 nm fluidic channel. If two detector volumes that are separated by 100 nm record coincident signals indicating the presence of probes, the recorded distance between the probes is 100 nm. However, the distance is preferably corrected for the non-perfect linearization of the DNA. In the case where the linearization is only 70%, the calculated distance between the two probes is 100 nm/0.70=143 nm.

The amount of linearization of the biopolymer may be determined from literature values or it may be obtained by calibrating the device under the conditions under which the experiment is to be run. A piece of DNA of known length may be placed in the fluidic channel and the two detector volumes having the greatest separation and recording signals indicating the presence of the DNA at the same time may be used to determine the measured length of the DNA. The correction factor may be determined by dividing the measured length by the known length. If the conditions such as ionic solution, temperature, and fluidic channel dimensions remain constant, the same correction factor may be used for every detector volume in the device. Thus, a piece of DNA that has a length of 500 nm may be placed in a device and measured to have a length of 400 nm. The correction factor for that device under the experimental conditions is 400 nm/500 nm=0.80. The correction factor of 0.80 may be used for every subsequent measurement in the device that is made under the same experimental conditions, including the calculation of distances between probes.

Example 3

A sensing device composed of two microfluidic chambers, one or more fluidic channels connecting the two microfluidic chambers, and two or more sensing electrodes disposed along the length of each fluidic channel (e.g., a nanochannel), is filled with an ionic fluid. Typically, the fluid may be water that contains buffering agents or salt or both buffering agent and salt.

The sensing electrodes in the fluidic channel are disposed such that they contain two or more different volumes between each pair. As shown in FIG. 9 c, the distances may be determined by a Golomb ruler. Alternatively, any spacing that results in at least two different lengths may be used. One such arrangement is shown in FIG. 10. Preferably many different lengths between sensing electrodes may be represented.

A target DNA strand is hybridized with a probe or a collection of different probes each of which preferentially binds a unique DNA sequence. The target, hybridized with probes, may be introduced into a microfluidic chamber that is connected to the fluidic channel that contains multiple sensing electrodes. Macroscopic electrodes are used to electrophorese the DNA strands from the microfluidic chamber into one or more fluidic channels. Alternatively, the DNA may be pumped through the fluidic channel by pressure-induced fluid flow. As the DNA enters the fluidic channel it assumes a linear conformation. The degree to which it is linearized is dependent on a number of factors. Some of those factors are the persistence length of the DNA strand, the temperature, the ionic conditions, and the width and depth of the fluidic channel.

As the DNA fragment moves down the fluidic channel it passes through each of the detector volumes. When the leading edge of the DNA enters a detector volume, a change in some electrical characteristic such as the potential between two sensing electrodes that are associated with the detector volume may be recorded. If the DNA strand is as long or longer than the distance between the two sensing electrodes, then the electrical signal may reach a constant maximum value. If the DNA strand is shorter than the distance between the two electrodes, the electrical signal may be a fractional value of the maximum signal that is equal to the ratio of the DNA strand length to the length of the distance between the two sensing electrodes. The maximum value may be predicted from the cross sectional area of the DNA or other analyte or may be determined with a reference DNA strand of known length that is longer than the detector volume.

When a portion of the DNA strand that has a probe hybridized enters the detecting volume, the electrical characteristic may change further. If the hybridized probe is longer than the distance between the two sensing electrodes associated with the detector volume, a new maximum value may be obtained. If the probe is shorter than the distance between the two sensing electrodes the new signal may be proportional to the maximum expected signal in direct relation to the ratio of the probe length to the distance between the two sensing electrodes as described in the previous paragraph. If a portion of the target DNA containing two hybridized probes is contained by a single detector volume, the electrical characteristic may change by an amount that is the sum of the changes for each of the hybridized probes. The two hybridized probes 710″ and 710′″ in FIG. 11 a are contained in a single sensing element as indicated by the two separate and additive electrical changes. When two or more probes are contained in a single sensing element the distance between the probes is less than the length of the detector volume. In FIG. 11 a, the distance between probe 710″ and probe 710′″ is less than the distance between electrodes (115C 115F).

The shortest distance between two sensing electrodes that contains two probes places an upper limit on the measured distance between the two probes. The longest distance between two sensing electrodes that does not encompass the two probes places a lower limit on the measured distance between the probes. The difference between these two distances defines the error in the measurement. Multiple copies of the same fragment may be observed independently in the same or multiple fluidic channels during the experiment.

Depending on how the distance between sensing electrodes was determined, a correction factor may be applied to the measured distance in order to calculate the true distance between the probes. The distance between sensing electrodes may be measured by optical or electron microscopy during or after fabrication of the device. In this case, the length of the DNA calculated by the separation of the sensing electrodes does not take into account the incomplete linearization of the DNA in the fluidic channel. The extent of linearization may be estimated from literature values for linearization determined by optical methods on DNA in fluidic channels. If the estimated extent of linearization is 75% then the measured length is divided by 0.75 to give the actual length.

The extent of linearization may also be determined by passing a DNA fragment of known length down the fluidic channel under the same conditions of temperature, pH, and ionic strength as the fragment of unknown length. The measured length may be used to calculate a correction factor as follows: the measured length/the known length=correction factor. For instance, if a fragment of DNA whose known length is 143 nm is placed in the fluidic channel and the measured length of the DNA fragment is 100 nm, then the correction factor is 0.70. This indicates that under this set of conditions and with this fluidic channel device, the extent of linearization is 70%. If an unknown fragment is measured in the same device under the same conditions and the measured length is 400 nm, then the actual length is 400 nm/0.70=571 nm.

The dynamic range of the detector may be determined by the number of different distances that are measured between the sensing electrodes. The resolution may be determined by the amount of variation in the different distances and by the accuracy of determining the length between each sensing electrode.

The designs described herein merge nanopore and nanofluidic channel technologies and decouple the driving electrophoretic force from the detected signal. By using voltage sensing and by fabricating voltage amplifiers directly on the substrate where the nanoscale electrodes are placed, the device may operate at higher frequencies than has been possible with previous geometries.

The described embodiments of the invention are intended to be merely exemplary and numerous variations and modifications will be apparent to those skilled in the art. All such variations and modifications are intended to be within the scope of the present invention as defined in the appended claims. 

1. A device for determining a length of an analyte by detecting electrical signals, the device comprising: a fluidic channel defined in a substrate; and a plurality of sensing electrodes disposed along a length of the fluidic channel for detection of two or more electrical signals corresponding to two or more detector volumes disposed along the fluidic channel, wherein (i) said sensing electrodes are configured for connection to a measurement tool for capturing said electrical signals corresponding to said detector volumes, (ii) the fluidic channel is a nanochannel or a microchannel, (iii) relative positions of the sensing electrodes are known, and (iv) the captured electrical signals in conjunction with the relative positions of the sensing electrodes indicate the length of the analyte.
 2. The device of claim 1, further comprising: a data collection device configured for recording electrical signals captured by the measurement tool as a function of time; and a computer in electrical communication with the data collection device, the computer programmed to determine which detector volumes record a change in electrical signal at the same time.
 3. The device of claim 1, further comprising an electronic circuit configured to output a signal only when two electrical signals corresponding to two detector volumes change at the same time.
 4. The device of claim 1, further comprising a pair of electromotive electrodes disposed at a first end and a second end of the fluidic channel.
 5. The device of claim 4, wherein the pair of electromotive electrodes disposed at the first and second ends of the channel comprises macroscopic electrodes arranged to generate a constant, changing, or oscillating electrophoretic force in the fluidic channel for translocation of the analyte disposed therein.
 6. The device of claim 1, wherein the device is configured such that positive pressure drives the analyte through the fluidic channel.
 7. The device of claim 1, wherein the device is configured such that a chemical gradient drives the analyte through the fluidic channel.
 8. The device of claim 1, wherein the substrate comprises a material selected from the group consisting of silicon, silicon dioxide, fused silica, and gallium arsenide.
 9. The device of claim 1, wherein at least one of the sensing electrodes comprises a material selected from the group consisting of platinum, gold, chrome, titanium, silver chloride, silver, and graphene.
 10. The device of claim 1, wherein a sensing element corresponding to a given detector volume comprises two sensing electrodes disposed on opposing sides of the fluidic channel.
 11. The device of claim 1, wherein a sensing element corresponding to a given detector volume comprises two sensing electrodes disposed on a first side of the fluidic channel.
 12. The device of claim 1, wherein a sensing element corresponding to a given detector volume comprises two sensing electrodes transversing the fluidic channel.
 13. The device of claim 1, wherein a sensing element corresponding to a given detector volume comprises a first sensing electrode transversing the fluidic channel, and a second sensing electrode on a side of the fluidic channel.
 14. The device of claim 1 wherein the measurement tool comprises at least one of a voltmeter, an ammeter, or a field-effect transistor.
 15. The device of claim 1, further comprising a plurality of fluidic channels defined in the substrate.
 16. The device of claim 1, further comprising a voltage amplifier configured to amplify the two or more electrical signals.
 17. The device of claim 1, wherein the fluidic channel has a width selected from a range of 1 nm to 5 μm.
 18. The device of claim 1, wherein the fluidic channel has a depth selected from a range of 1 nm to 5 μm.
 19. The device of claim 1, wherein the fluidic channel has a length selected from a range of 1 μm to 10 cm.
 20. The device of claim 1, wherein (i) the device comprises at least three pairs of sensing electrodes, and (ii) a distance between detector volumes corresponding to a first and second pair of sensing electrodes is unequal to a distance between detector volumes corresponding to the second and third pair of electrodes.
 21. The device of claim 1, wherein (i) the device comprises at least three pairs of sensing electrodes, (ii) a detector volume corresponding to a first pair of sensing electrodes is unequal to a detector volume corresponding to a second pair of sensing electrodes, and (iii) a detector volume corresponding to the second pair of sensing electrodes is unequal to a detector volume corresponding to a third pair of sensing electrodes.
 22. (canceled)
 23. A method for determining a length of an analyte, the method comprising the steps of: disposing the analyte in a fluidic channel; applying a potential along the fluidic channel; translocating the analyte from a first end of the fluidic channel to a second end of the fluidic channel; detecting two or more electrical signals as the analyte moves through the fluidic channel, said two or more electrical signals corresponding to two or more detector volumes of the fluidic channel, said two or more electrical signals being detected using a plurality of sensing electrodes disposed along the length of the fluidic channel; and determining the length of the analyte by analyzing the two or more detected electrical signals, wherein the fluidic channel is a nanochannel or a microchannel.
 24. The method of claim 23, wherein applying the potential along the fluidic channel generates an electrophoretic force therein.
 25. The method of claim 23, wherein translocating the analyte comprises using a chemical gradient.
 26. The method of claim 23, wherein translocating the analyte comprises using a pressure differential.
 27. The method of claim 23, wherein determining the length of the analyte comprises identifying at least two detector volumes in which the analyte is sensed at a given time, and determining a distance between sensing electrodes corresponding to said at least two detector volumes.
 28. The method of claim 23, wherein an amount that the analyte partially fills the detector volume is determined by comparing the electrical signal caused by the analyte to a maximum signal caused by a sample biopolymer long enough to fill the detector volume entirely.
 29. The method of claim 23, further comprising applying a correction factor to a measured length to determine an actual length of the analyte.
 30. The method of claim 23, wherein the analyte comprises a biopolymer selected from the group consisting of deoxyribonucleic acids, ribonucleic acids, and polypeptides.
 31. The method of claim 30, wherein the biopolymer is a single-stranded molecule.
 32. The method of claim 23, wherein the analyte is at least partially hybridized, and the detected electrical signals indicate the presence of a probe bound to the analyte.
 33. A method for sequencing a biopolymer, the method comprising: preparing an analyte by hybridizing a first plurality of probes with the biopolymer such that said first plurality of probes attaches to portions of the biomolecule to produce a partially hybridized biomolecule; disposing the analyte in a fluidic channel; applying a potential along the fluidic channel; translocating the analyte from a first end of the fluidic channel to a second end of the fluidic channel; detecting two or more electrical signals as the analyte moves through the fluidic channel, said two or more electrical signals corresponding to two or more detector volumes of the fluidic channel, said two or more electrical signals being detected by using a plurality of sensing electrodes disposed along the length of the fluidic channel, the detected electrical signals indicating locations of the hybridized probes along the biopolymer; analyzing the electrical signals to determine in which detector volumes probes bound to the biomolecule are located; determining at least a portion of the sequence of the biopolymer using a distance between sensing electrodes corresponding to said detector volumes in which the probes are located, wherein the fluidic channel is a nanochannel or a microchannel.
 34. The method of claim 33 wherein applying the potential along the fluidic channel generates an electrophoretic force therein.
 35. The method of claim 33, wherein translocating the analyte comprises using a chemical gradient.
 36. The method of claim 33, wherein translocating the analyte comprises using a pressure differential.
 37. The method of claim 33, further comprising determining a distance between probes using a coincident response of electrical signals corresponding to two or more detector volumes.
 38. The method of claim 37, wherein a spacing between sensing electrodes is used to determine a maximum distance between probes.
 39. The method of claim 37, wherein a spacing between sensing electrodes is used to determine a minimum distance between probes.
 40. The method of claim 33, wherein the electrical signal initially changes when the biopolymer moves through a detector volume associated with two sensing electrodes and further changes when a portion of the biopolymer comprising a hybridized probe moves through the detector volume.
 41. (canceled)
 42. The method of claim 33, further comprising hybridizing a second plurality of probes with the biopolymer and repeating the detecting, analyzing, and determining steps with said second plurality of probes.
 43. The method of claim 42, further comprising using the two or more electrical signals to detect and record complexed and uncomplexed regions of the biopolymer to create a first probe map of the first plurality of probes and a second probe map of the second plurality of probes, the first probe map and the second probe map respectively comprising information about the relative positions of the hybridized first and second plurality of probes.
 44. The method of claim 43, further comprising determining a candidate sequence by ordering at least two probe sequences using at least one of positional information or a combination of overlapping probe binding sequences and positional information.
 45. (canceled)
 46. The method of claim 45, further comprising determining a candidate sequence by ordering at least two probe sequences using at least one of (i) positional information and parameters relating to the error in positional information or (ii) a combination of overlapping sequences of the probe molecules and positional information and the error in positional information.
 47. (canceled)
 48. The method of claim 47, wherein preparing the analyte comprises contacting the biopolymer with a first probe having a first probe specificity for recognition sites of the biopolymer to form a first plurality of local ternary complexes, the first probe having a first known recognition site sequence.
 49. The method of claim 48, further comprising using the electrical signal to determine positional information of the first plurality of local ternary complexes.
 50. The method of claim 48, wherein preparing the analyte further comprises contacting the biopolymer with a second probe having a second probe specificity for recognition sites of the biopolymer to form a second plurality of local ternary complexes, the second probe having a second known recognition site sequence.
 51. The method of claim 50, further comprising aligning positional information of at least the first and second plurality of local ternary complexes to determine a sequence of the biopolymer.
 52. A system for determining a length of an analyte, the system comprising an analyzing module that determines the length of the analyte based at least in part on a plurality of electrical signals captured by the device of claim
 1. 53. An apparatus for determining a length of an analyte, the apparatus comprising: (a) a memory that stores code defining a set of instructions; and (b) a processor that executes the instructions thereby to determine the length of the analyte from two or more detected electrical signals captured by the device of claim
 1. 54. A system for sequencing a biopolymer, the system comprising: (a) a fluidic channel defined in a substrate; (b) a plurality of sensing electrodes disposed along a length of the fluidic channel for detection of two or more electrical signals corresponding to two or more detector volumes disposed along the fluidic channel, wherein the fluidic channel is configured such that a biopolymer with at least a first plurality of probes attached thereto may pass therethrough, wherein said sensing electrodes are configured for connection to a measurement tool for capturing said electrical signals corresponding to said detector volumes as said biopolymer passes through the fluidic channel, wherein the fluidic channel is a nanochannel or a microchannel, and wherein relative positions of the sensing electrodes are known; and (c) an analyzing module that determines at least a portion of the sequence of the biopolymer based at least in part on a plurality of the captured electrical signals.
 55. (canceled)
 56. An apparatus for sequencing a biopolymer, the apparatus comprising: (a) a fluidic channel defined in a substrate; and (b) a plurality of sensing electrodes disposed along a length of the fluidic channel for detection of two or more electrical signals corresponding to two or more detector volumes disposed along the fluidic channel, wherein the fluidic channel is configured such that a biopolymer with at least a first plurality of probes attached thereto may pass therethrough, wherein said sensing electrodes are configured for connection to a measurement tool for capturing said electrical signals corresponding to said detector volumes as said biopolymer passes through the fluidic channel, wherein the fluidic channel is a nanochannel or a microchannel, wherein relative positions of the sensing electrodes are known, and wherein the captured electrical signals in conjunction with the relative positions of the sensing electrodes indicate at least a portion of the sequence of the biopolymer.
 57. A device for voltage sensing of analytes, the device comprising: a fluidic channel defined in a substrate; a first and a second pair of sensing electrodes disposed in the channel for sensing voltage therein, the second pair of sensing electrodes being disposed distal to the first pair of sensing electrodes, and each pair of sensing electrodes comprising a first and a second electrode disposed at two discrete locations along a length of the channel; and a pair of electromotive electrodes disposed at a first end and a second end of the channel for applying a potential along the channel, wherein the channel comprises a nanochannel or a microchannel.
 58. The device of claim 57, wherein the substrate comprises a material selected from the group consisting of silicon, silicon dioxide, and fused silica. 